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Preface 


Many breakthroughs in research and, more generally, solutions to problems come as 
the result of someone making connections. These connections are sometimes quite 
subtle, and at first blush, they may not appear to be plausible candidates for part 
of the solution to a difficult problem. In this book, we think of these connections 
as bridges. A bridge enables the possibility of a solution to a problem that may 
have a very elementary statement but whose solution may involve more complicated 
realms that may not be directly indicated by the problem statement. Bridges extend 
and build on existing ideas and provide new knowledge and strategies for the solver. 
The ideal audience for this book consists of ambitious students who are seeking 
useful tools and strategies for solving difficult problems (many of olympiad caliber), 
primarily in the areas of real analysis and linear algebra. 

The opening chapter (aptly called “Chapter 1”) explores the metaphor of bridges 
by presenting a myriad of problems that span a diverse set of mathematical 
fields. In subsequent chapters, it is left to the reader to decide what constitutes 
a bridge. Indeed, different people may well have different opinions of whether 
something is a (useful) bridge or not. Each chapter is composed of three parts: the 
theoretical discussion, proposed problems, and solutions to the proposed problems. 
In each chapter, the theoretical discussion sets the stage for at least one bridge 
by introducing and motivating the themes of that chapter—often with a review of 
some definitions and proofs of classical results. The remainder of the theoretical 
part of each chapter (and indeed the majority) is devoted to examining illustrative 
examples—that is, several problems are presented, each followed by at least one 
solution. It is assumed that the reader is intimately familiar with real analysis and 
linear algebra, including their theoretical developments. There is also a chapter 
that assumes a detailed knowledge of abstract algebra, specifically, group theory. 
However, for the not so familiar with higher mathematics reader, we recommend a 
few books in the bibliography that will surely help, like [5—9, 11, 12]. 

Bridges can be found everywhere—and not just in mathematics. One such final 
bridge is from us to our friends who carefully read the manuscript and made 
extremely valuable comments that helped us a lot throughout the making of the 
book. It is, of course, a bridge of acknowledgments and thanks; so, last but not least, 


v 
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we must say that we are deeply grateful to Gabriel Dospinescu and Chris Jewell for 
all their help along the way to the final form of our work. 

In closing, as you read this book, we invite you to discover some of these bridges 
and embrace their power in solving challenging problems. 


Richardson, TX, USA Titu Andreescu 
Targoviste, Romania Cristinel Mortici 
Barlad, Romania Marian Tetiva 
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Chapter 1 
Mathematical (and Other) Bridges 


Many people who read this book will probably be familiar with the following result 
(very folkloric, if we may say so). 


Problem 1. The midpoints of the bases of a trapezoid, the point at which its 
lateral sides meet, and the point of intersection of its diagonals are four collinear 
points. 


E 


A M'=M B 


Solution. Indeed, let ABCD be a trapezoid with AB || CD; let M and N be the 
midpoints of the line segments AB and CD, respectively; and let {E} = AD N BC 
and {F} = AC N BD. We intend to prove that M, N, E, and F are four collinear 
points. 

Denote by M’ the intersection of EF with AB. By Ceva’s theorem, we have 


AM’ BC ED | 
M'B CE DA | 
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Also, Thales’ interception theorem says that 


DE CE CB DE 


DA CB” CE DA ” 


and by putting together the above two equations, we get M'A = M'’B, that is, M’ is 
actually the midpoint of AB; therefore, M = M’ belongs to EF, which is (part of) 
what we intended to prove. O 


The reader will definitely find a similar way (or will be able to use the already 
proved fact about the collinearity of M, E, and F) to show that N, E, and F are also 
collinear. 

One can also prove that a converse of this theorem is valid, that is, for instance, if 
M, E, and F are collinear, then AB and CD are parallel (just proceed analogously, but 
going in the opposite direction). Or try to prove that if the midpoints of two opposite 
sides of a trapezoid and the intersection point of its diagonals are three collinear 
points, then the quadrilateral is actually a trapezoid (the sides whose midpoints we 
are talking about are the parallel sides); this could be more challenging to prove. 

As we said, this is a well-known theorem in elementary Euclidean geometry, 
so why bother to mention it here? Well, this is because we find in it a very good 
example of a problem that needs a (mathematical) bridge. Namely, you noticed that 
the problem statement is very easy to understand even for a person who only has a 
very humble background in geometry—but that person wouldn’t be able to solve the 
problem. You could be familiar with basic notions as collinearity and parallelism, 
you could know such things as properties of angles determined by two parallel lines 
and a transversal, but any attempt to solve the problem with such tools will fail. One 
needs much more in order to achieve such a goal, namely, one needs a new theory— 
we are talking about the theory of similarity. In other words, if you want to solve 
this problem, you have to raise your knowledge to new facts that are not mentioned 
in its statement. You need to throw a bridge from the narrow realm where you are 
stuck to a larger extent. 

The following problem illustrates the same situation. 


Problem 2. Determine all monotone functions f : N* —> R such that 


Flay) = f(x) + fO) for all x, y € N* 


(N* denotes the set of positive integers, while R denotes the reals). 


Solution. All that one can get from the given relation satisfied by f is f(1) = 0 
and the obvious generalization f(x, ---x,) = f(x1) + -+ + f(x) for all positive 
integers x;,...,X, (an easy and canonical induction leads to this formula) with its 
corollary f(x") = kf(x) for all positive integers x and k. But nothing else can be 
done if you don’t step into a higher domain (mathematical analysis, in this case) 
and if you don’t come up with an idea. The idea is possible in that superior domain, 
being somehow natural if you want to use limits. 
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Let n be a positive integer (arbitrary, but fixed for the moment), and let us 
consider, for any positive integer k, the unique nonnegative integer ną such that 
2" < nk < 2%+!, Rewriting these inequalities in the form 


Inn 1 n lnn 
ln2 k k` In2 
one sees immediately that jim ng/k = lnn/ln2. 
00 
Because if f is increasing, —f is decreasing and satisfies the same functional 
equation (and conversely), we can assume, without loss of generality, that f is 


increasing. Then from the inequalities satisfied by the numbers ng and by applying 
the noticed property of f, we obtain 


1 
JÆ < sol) sS & Ho <f s (447) sO, 
Now we can let k go to infinity, yielding 
Inn 
fn) =f 


for any positive integer n. So, all solutions are given by a formula of type f(n) = 
alnn, for a fixed real constant a. If f is strictly increasing (or strictly decreasing), 
we get f(2) > f(1) = 0 (respectively, f(2) < f(1) = 0); thus f(2) # 0, and, 
with b = 2!/f2), the formula becomes f(n) = log, n (with greater, respectively 
lesser than 1 base b of the logarithm according to whether f is strictly increasing, or 
strictly decreasing). The null function (f(n) = 0 for all n) can be considered among 
the solutions, if we do not ask only for strictly monotonic functions. By the way, if 
we drop the monotonicity condition, we can find numerous examples of functions 
that only satisfy the first condition. For instance, define f (n) = a, +a) +- - -+ a, for 
n = pi' --- pt, with pi,..., px distinct primes and a1, .. . , ag positive integers and, 
of course, f(1) = 0, and we have a function with property f (mn) = f(m) + f(n) 
for all positive integers m and n. The interested reader can verify for himself this 
condition and the fact that f is not of the form f(n) = log,n, for some positive 
b # 1 (or, equivalently, that this function is not monotone). C 


Again, one sees that in order to solve such a problem, one needs to build a bridge 
between the very elementary statement of the problem and the much more involved 
realm of mathematical analysis, where the problem can be solved. 

However, there is more about this problem for the authors of this book. Namely, 
it also demonstrates another kind of bridge—a bridge over the troubled water of 
time, a bridge connecting moments of our lives. As youngsters are preoccupied 
by mathematics, we had (behind the Iron Curtain, during the Cold War) very 
few sources of information and very few periodicals to work with. There were, 
say, in the 80s of the former century, Gazeta Matematică and Revista matematică 
din Timişoara—only two mathematical magazines. The first one was a monthly 
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magazine founded long ago, in 1895, by a few enthusiastic mathematicians and 
engineers among which Gheorghe Titeica is most widely known. The second 
magazine used to appear twice a year and was much younger than its sister, but also 
had a national spreading due again to some enthusiastic editors. Anyway, this is all 
we had, and with some effort, we could also get access to Russian magazines such 
as Kvant or Matematika v Skole, or the Bulgarian Matematika. Two of the authors of 
this book were at the time acquainted with problem 2 through Revista Matematică 
din Timisoara. They were high school students at that time and thoroughly followed 
up the problem column of this magazine, especially a “selected problems” column 
where they first met this problem (and couldn’t solve it). The third author was the 
editor of that column—guess who is who! Anyway, for all three of us, a large 
amount of the problems in this book represent as many (nostalgic) bridges between 
past and present. Problem 2 is one of them, and we have many more, from which a 
few examples are presented below. 


Problem 3. Are there continuous functions f : R — R such that 
SQ) + f(x) + x = 0 for every real x? 


Solution. No, there is no such function. The first observation is that if a function 
with the stated properties existed, then it would be strictly monotone. This is because 
such a function must be injective (the reader will immediately check that f(x1) = 
f (x2) implies f(f(x1)) = f(f(x2)); therefore, by the given equation, xı = x2). Now, 
injectivity and continuity together imply strict monotonicity; so if such a function 
existed, it would be either strictly increasing or strictly decreasing. 

However, if f is strictly increasing, then f o f and f of +f + 1p are also strictly 
increasing, which is impossible, because f o f +f + lg must equal the identically 0 
function (by 1 we mean the identity function of the reals defined by 1g (x) = x for 
every real x). On the other hand, by replacing x with f(x) in the given equation, we 
get fFF) +f£F@)) + f(x) = 0 for all x, and subtracting the original equation 
from this one yields f(f(f(x))) = x for all x or f of of = 1p. This equality is a 
contradiction when f (and f o f o f also) is strictly decreasing and the solution ends 
here. O 


By the way, note that if a,, d,—-1, ..., ao are real numbers such that the equation 
aX” + Gy—)x" | +--+ -+a = 0 has no real solutions, then there exists no continuous 
function f : R > R such that af"! + a-f"! + -+ + agf™ = 0. Here fl is 
the nth iterate of f (with fl! = 1,), and 0 represents the identically 0 function. This 
was a (pretty challenging at the time) problem that we had on a test in the mentioned 
above eighties, on a preparation camp. A bridge, isn’t it? 


Problem 4. Prove that among any 79 consecutive positive integers, there exists 
at least one such that the sum of its digits is divisible by 13. Find the smallest 78 
consecutive positive integers such that none of them has its sum of digits divisible 
by 13. 
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Solution. By S(N) we will denote the sum of digits of the natural number N. We 
can always find among 79 consecutive natural numbers 40 of the form 100k + a0, 
100k + al,..., 100k + (a + 3)9, with k a natural number and a < 6 a digit. Among 
the sums of digits of these numbers, there are S(k) +a, S(k) +a+1,...,S(k) +a+12, 
that is, there are 13 consecutive natural numbers, one of which has to be divisible 
by 13. 

Now, for the second part, we have to choose the desired numbers in such a 
way that no forty of them starting with a multiple of 10 are in a segment of 
natural numbers of the form {100k, 100k + 1,...,100k + 99}. This can only 
happen if the numbers are of the form 100a — 39, 100a — 38,...,100a + 38, 
for some natural number a. Actually we will consider numbers of the form 
10° — 39, 10° — 38,..., 10° + 38, with b > 2, because it will be important how 
many nines there are before the last two digits. The sums of digits of the numbers 
10°,10? + 1,..., 10 + 38 will cover all possibilities from 1 to 12. The sums of 
digits of the numbers 10? —39, 10° —38,..., 10? — 1 will range from 9(b— 2) +7 to 
9(b—2)+ 18, and it is necessary that they cover exactly the same remainders modulo 
13 (from 1 to 12). For this to happen, we need to have 9(b — 2) +7 = 1 (mod 13), 
which gives b = 10 (mod 13). So, the smallest possible 78 such numbers are 
those obtained for b = 10, thus the (78 consecutive) numbers from 9999999961 
to 10000000038. O 


This is a problem that we know from the good old RMT. 


Problem 5 (Erdős-Ginzburg-Ziv theorem). Prove that among any 2n — 1 
integers, one can find n with their sum divisible by n. 


Solution. This is an important theorem, and it opened many new approaches in 
combinatorics, number theory, and group theory (and other branches of mathemat- 
ics) in the middle of the twentieth century (it has been proven in 1961). However, 
we first met it in Kvant, with no name attached, and it was also Kvant that informed 
us about the original proof. Seemingly the problem looks like that (very known) 
one which states that from any n integers, one can choose a few with their sum 
divisible by n. The solution goes like this. If the numbers are a1, ..., an, consider 
the n numbers aj, ad, +a2,..., a1 +42 +: +a. If there is any of them divisible with 
n, the solution ends; otherwise, they are n numbers leaving, when divided by n, only 
n—1 possible remainders (the nonzero ones); therefore, by the pigeonhole principle, 
there are two of them, say a; +--+ + a; and a, + -++ + aj, with, say, i < j that are 
congruent modulo n. Then their difference a;+; + ++ + a; is, of course, divisible by 
n (and is a sum of a few of the initial numbers). We put here this solution (otherwise, 
we are sure that it is well-known by our readers) only to see that there is no way to 
use its idea for solving problem 5 (which is a much deeper theorem). Indeed, the 
above solution allows no control on the number of elements in the sum that results 
to be divisible by n; hence, it is of no use for problem 5. The proof that we present 
now (actually the original proof of the three mathematicians) is very ingenious and, 
of course, builds a bridge. 
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The first useful observation is that the property from the theorem is multiplica- 
tive, that is, if we name it P(n), we can prove that P(a) and P(b) together imply 
P(ab). This permits an important reduction of the problem to the case of prime n 
(and it is used in all the proofs that we know). We leave this as an (easy and nice) 
exercise for the reader. So, further, we only want to prove (and it suffices, too) that 
from any 2p — 1 integers one can always choose p with their sum divisible by p, 
where p is a positive prime. 

The bridge we throw is towards the following: 


Theorem. Let A and B be subsets of Zp, with p prime, and let 
A+B={a+b|aeA, be B. 


Then we have |A + B| > min{p, |A| + |B| — 1}. (By |X|, we mean the number of 
elements of the set X.) 


We skip the proof of this (important) theorem named after Cauchy and Davenport 
(the second rediscovered it a century after the first one; each of them needed it in 
his research on other great mathematical results), but we insist on the following: 


Corollary. Let A,,...,As be 2-element subsets of Zp. Then 
|A + + A| > min{p, s + 1}. 
In particular, if A,,...,Ap—1 are subsets with two elements of Zp, then 
Ai +++: +Ap-1 = Zp; 


that is, every element from Z, can be realized as a sum of elements from 
Aj,...,Ap—1 (one in each set). 


This corollary is all one needs to prove Erdés-Ginzburg-Ziv’s theorem, and it can 
be demonstrated by a simple induction over s. The base case s = 1 being evident, 
let’s assume that the result holds for s two-element subsets of Z, and prove it for 
s + 1 such subsets Aj,...,A;41. If s + 1 > p, we have nothing to prove; hence, 
we may assume that the opposite inequality holds. In this case, by the induction 
hypothesis, there are at least s + 1 distinct elements x;,...,%;41 in Ay +--+: + As. 
Let A,41 = {y, z}; then the set A, +---+A,+A,+, surely contains xı +y, . . . , Xs+1 +Y 
and xı +z, . . . , Xs+1 +Z. But the sets {x1 +y, . . . , Xs+1 +y} and {x1 +z, . . . , Xs+1 +Z} 
cannot be equal, because in that case, we would have 


Ca +y) +: + spr +Y) = a +2) H Esti +z), 


which means (s + l)y = (s+ 1)z. As 1 < s+ 1 < p -— l, this implies y = z 
in Zp, which is impossible (because y and z are the two distinct elements of As+1). 
Consequently, among the elements x; + y,...,%s+1 + y and xı + Z,...,Xs41 +Z 
of Aj +--+ + As41, there are at least s + 2 mutually distinct elements, finishing the 
proof. 
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@p—1 to be the remainders of the given 2p — | integers when divided by p, in 
increasing order. If, for example, a; = ap, then aj = ay = ... = ap, and the 
sum of the p numbers that leave the remainders a1,...,ap is certainly divisible 
by p; similarly, the problem is solved when any equality a; = aj+p—; holds (for 
any 1 < j < p). Thus we can assume further that (for every 1 < j < p) aj and 
dj+p—1 are distinct. Now we can consider the two-element subsets of Z, defined by 
Aj = (qj, 4j+p-1}, 1 < j < p — 1. (We do not use a special notation for the residue 
class modulo p of the number x, which is also denoted by x.) 

According to the above corollary of the Cauchy-Davenport theorem, A; + ++- + 
Ap—1 has at least p elements; therefore, it covers all Z,. Consequently, there exist 
ij,..., 4-1 Such that ij is either j or j + p — 1 for any j € {1,...,p — 1} and 
ai +--+ + di, = —rp-1 in Zp. This means that the sum aj, +--+ + Gi, + Grp-1 
(where, clearly, all indices are different) is divisible by p, that is, the sum of the 
corresponding initial numbers is divisible by p, finishing the proof. O 


Now for the proof of Erdés-Ginzburg-Ziv theorem, consider aq; < a2 < ... < 


One can observe that the same argument applies to prove the stronger assertion 
that among any 2p — | given integers, there exist p with their sum giving any 
remainder we want when divided by p. Also, note that the numbers 0,...,0,1,...,1 
(n — | zeros and n — 1 ones) are 2n — 2 integers among which one cannot find any 
n with their sum divisible by n (this time n needs not be a prime). Thus, the number 
2n— 1 from the statement of the theorem is minimal with respect to n and the stated 
property. 

There are now many proofs of this celebrated theorem, each and every one 
bringing its amount of beauty and cleverness. For instance, one of them uses the 
congruence 


XO iy teet = 0 (mod p) 


l<i <-<ip<2p—-1 


(the sum is over all possible choices of a subset of p elements of the set 
{1,...,2p— 1}; in other words, it contains all sums of p numbers among the 2p — 1 
given integers, which we denoted by x), .. . , X2p—1). Knowing this congruence and 
Fermat’s Theorem, one gets N = 0 (mod p), where N means the number of those 
sums of p of the given 2p — 1 integers that are not divisible by p. However, if all 


p-1 
(mod p), a contradiction—hence there must exist at least one sum of p numbers that 
is divisible by p. 
This proof is somehow simpler than the previous one, but it relies on the above 
congruence, which, in turn, can be obtained from the general identity 


2p —1 
the possible sums weren’t divisible by p, we would have N = ( g ) = 1 


k 
D cir (ys)' =o 
m} 


ieS 
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valid for all elements x1, . . . , Xm of a commutative ring and for any 1 < k < m—1. 
For k = m, we need to replace the 0 from the right hand side with m!xı ...Xm, 
and one can find results for the corresponding sum obtained by letting k = m + 1, 
k = m + 2, and so on, but this is not interesting for us here. Let us only remark 
how another bridge (a connection between this identity and the Erdés-Ginzburg-Ziv 
theorem) appeared, seemingly out of the blue. The reader can prove the identity for 
himself (or herself) and use it then for every group of p of the given 2p — 1 integers, 
with exponent p — 1, and then add all the yielded equalities; then try to figure out 
(it is not hard at all) how these manipulations lead to the desired congruence and, 
finally, to the second (very compact) proof of the Erdés-Ginzburg-Ziv’s theorem. 
However, we needed a bridge. What this book tries to say is that there are bridges 
everywhere (in mathematics and in the real life). At least nostalgic bridges, if none 
other are evident. 

Let us see now a few more problems whose solutions we’ll provide after the 
reader has already tried (a bit or more) to solve independently. As the whole book, 
the collection is eclectic and very subjective—and it is based on the good old 
sources from our youth, such as Gazeta Matematică (GM), Revista matematică din 
Timisoara (RMT), Kvant, the Romanian olympiad or TSTs, and so on. Most of the 
problems are folklore (and their solutions, too), but they first came to us from these 
sources. When the problems have proposers we mention them; otherwise, as they 
can be found in many books and magazines, we avoid any references—every reader, 
we are sure, knows where to find them. 


Proposed Problems 


1. (Mihai Baluna, RMT) Find all positive integers n such that any permutation of 
the digits of n (in base ten) produces a perfect square. 

2. Let a,,...,a@, be real numbers situated on a circumference and having zero 
sum. Prove that there exists an index i such that the n sums aj, a; + dj+41,..-, 
ai + ain, + +++ + Gj4n—1 are all nonnegative. Here, all indices are considered 
modulo n. 

3. Prove that there exist integers a, b, and c, not all zero and with absolute values 
less than one million, such that |a + bV2 + cV3] < 107". 

4. Prove that, for any positive integer k, there exist k consecutive natural numbers 
such that each of them is not square-free. 

5. Find the largest possible side of an equilateral triangle with vertices within a 
unit square. (The vertices can be inside the square or on its boundary.) 

6. Let A and B be square matrices of the same order such that AB— BA = A. Prove 
that A"B — BA” = mA” for all m € N* and that A is nilpotent. 

7. (Dorel Mihet, RMT) Prove that from the set {1*,2*,3*,...} of the powers 
with exponent k € N* of the positive integers, one cannot extract an infinite 
arithmetic progression. 


Solutions 9 


8. 


10. 


Let 1,4, 8, 9, 16,27,32,... be the sequence of the powers of natural numbers 
with exponent at least 2. Prove that there are arbitrarily long (nonconstant) 
arithmetic progressions with terms from this sequence, but one cannot find such 
a progression that is infinite. 


. (Vasile Postolica, RMT) Let (an)n>1 be a convergent increasing sequence. Prove 


that the sequence with general term 


(an+1 — an) (an+1 — Gn—1) «+ (Gn41 — 41) 


is convergent, and find its limit. 
What can we say if we only know that (a,),>1 is increasing? 
Let f be a continuous real function defined on [0, 00) such that lim f(nt) = 0 
n—>Co 


for every t in a given open interval (p, q) (0 < p < q). Prove that lim f(x) = 0. 
x00 


11. (Mihai Onucu Drimbe, GM) Find all continuous functions f : R —> R such that 
FAEFY+D +A +O) SIO =fOE+y+fE+D+fOt+7 
for all x,y,z € R. 
12. (Dorel Mihet, RMT) Let f : [a, b] — [a, b] (where a < b are real numbers) be a 
differentiable function for which f(a) = b and f(b) = a. Prove that there exist 
C1, C2 € (a,b) such that f’(cy)f’(c2) = 1. 
13. Evaluate 
m/2 1 
— dx. 
Í 1 + (tan xv? 
14. Show that 
: ? xX” _ 
lim | (14+=) da= b-a 
n> Ja n 
for all real numbers a and b. 
Solutions 
1. Only the one-digit squares (that is, 1, 4, and 9) have (evidently) this property. 


Suppose a number with at least two digits has the property. It is well known 
that a number with at least two digits and for which all digits are equal cannot 
be a square; therefore, there must be at least two distinct digits, say a and b, 
witha < b. Then if ...ab = k? and |.. ba = Ê, we clearly have k < L, hence 
l> k+ 1, and 


2k+ 1 = (k+1} -kK <P -k = %b-«) < 81. 
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So k < 40 and a direct (boring, but simple) inspection show that no perfect 
square (with at least two digits) until 40? = 1600 has the required property. 


. Solution I. Among all sums of the form a; + aj4) +--+: + aj+x—-1 (for different 


integers j € {1,...,n} and k > 1), there must be a minimal one. Because 
any cyclic permutation of the numbers affects neither the hypothesis nor the 
conclusion of the problem, we can assume that this minimal sum is a; + +--+ 
a;—,. We claim that in this case, a; + ai+1 +--+ + @j4x4—-1 = 0 for all k. Indeed, 
as long as i+ k—1 <n, this means that a; + -++ + ai+k-1 = a1 +++ + Gj-1. 
When i + k — 1 exceeds n, we use the hypothesis a; + --- + a, = 0 to see that 
ait dii +--+ +aj4x-1 = Ois equivalent tO an+ +e +aj4n-1 Z A, +++ + aj}. 


Solution II. We proceed by contradiction and suppose that for every i € 
{1,...,n}, there exists some j > 1 such that a; + -++ + ai+j—1 < 0. So we find a 
sequence 0 = ig < ij < +++ < i, with property Ay = ai+1 +: + diy, < 0 for 
all s € {0,1,...,}. By the pigeonhole principle, there are j and k, with j < k, 
such that i; = i, (mod n), and in that case, Aj + --- + Ag_; < 0 represents a 
contradiction, because A; + --- + Ag—ı is a multiple of a; + -+- + an; therefore 
it is, in fact, 0. 


. There exist 10!8 numbers of the form x + y/2 + 2/3, with x, y, and z integers 


from the set {0, 1...., 10° — 1}. Any two such numbers are distinct (we discuss 
this a little bit later), and any such number is at most equal to (10°—1)(1+ /2+ 
/3) < 5- 106. Divide the interval [0, 5 - 10°) into 10!8 — 1 disjoint subintervals 
with equal lengths 5-10°/(10!8—1), and observe that there must be two numbers 
in the same interval; thus, there exist two of these numbers having the absolute 
value of their difference less than 5-10°/(10!8 —1) < 107!!. If the numbers are 
xı + yı V2 + zı V3 and x2 + y2 V2 + V3, we get |a + bV2 + cV3| < 1071! 
for a = xı — x2, b = yı — y2, and c = zı — Z2, Which are integers (at least one 
of them being nonzero) with absolute values less than one million. 

There are two more issues about this problem that we want (and have) to 
discuss. One of them is necessary to complete the proof, namely, we still need to 
show that two numbers of the form x + y/2+ zV/3 can only be equal whenever 
the corresponding coefficients x, y, and z are equal. More specifically, if xı + 
yV2 + zı v3 = xX + y2 v2 + z243, and x1, Y1, Z1, X2, Y2, Z2 are integers, then 
Xx = X, yı = yo, and zı = z2. Equivalently, if x + yV2 + 2/3 = 0, with 
integers x, y, and z, then x = y = z = 0. (We need this fact at the beginning 
of the above proof, when we number the numbers of the form x + yV2 + zV3, 
with 0 < x,y,z < 10!8 — 1: they have to be mutually distinct when they differ 
by at least a component. We also need it in the last step of the proof, in order to 
show that at least one of the obtained a, b, and c is nonzero.) 

Actually, one can prove a more general statement, namely, that if x, y, z, t, 
are rational numbers and x + yV/2 +273 + tv6 = 0, then x = y=z=t=0. 
Indeed, we have z + t/2 = 0 if and only if z = t = 0 (this is well-known 
and actually is another way to state the irrationality of /2). So, if we have 
z + tv2 = 0, the original equation yields x + y/2 = 0, too, and the desired 
conclusion easily follows. Otherwise, we can rearrange the equation as 
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xtyV/2 


ER E 


=ut+ovv2, 


where u and v are rational numbers. But this implies 3 = u? + 2v? + 2uv V2, 
hence, by invoking the irrationality of /2 again, we get 2uv = 0 and u? + 
2v? = 3, which, as the reader can check, immediately leads to contradiction. 

Actually a much more general statement is valid, namely, that a, %/b, + 
+ dy a/Dy = 0, with a1, . . . , am, b1, .--, bm E€ Q (bi, ..., bm being nonzero) 
implies a} = --- = am = 0 whenever there do not exist i # j among the 
numbers 1,...,m such that b;/b; is an nth power of a rational number. It is 
the so-called linear independence of radicals, a rather folkloric result. When 
m = 2, one can give a similar proof to the one above; the reader can try to see 
that, after a few reductions, it is enough to prove the following: if pı, ..., py 
are positive primes and a; are rational numbers indexed after the subsets J of 
{1,..., N} such that 


then (at least) 2 — 1 of the coefficients a; are null. (The product corresponding 
to the empty set is 1.) We arrived pretty far away from the initial point, didn’t 
we? 

The second issue doesn’t belong to this proof—it is more like a reminder. 
What we want to say is that this proof looks very similar to the proof of 
Dirichlet’s approximation theorem. This theorem says that, given the real 
numbers aj,...,ag and given € > O, there exist integers n and m,..., mx 
such that |na; — m;| < € for all 1 < i < k. Indeed, let us consider some positive 
integer N satisfying N > 1/e and look at the intervals 


pelos palt- pala 
1 Np le N N s... IN Z N ’ 


that are partitioning [0, 1). Consider also the k-tuples (b® ee ye) defined by 
bY = sif and only if {ja;} € I, for 1 <i < kand 1 < j <N*+1. Each b® can 
only take the N values from the set {1,...,N}; hence, the N% + 1 k-tuples can 
have at most N* values. By the pigeonhole principle, there are two of them, say 
(On sisted ,b® ) and (b? ius ,b® ) that are identical. This means that {ja;} and 
{la;} belong to the same interval J, (with s depending on i) for each and every i 
from | to k, further yielding 


[ja -tadl < $ <e 
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for all 1 < i < k, or 


|G — Dai — (Yjai] — [la:i])| < € 


for all i. It is thus enough to choose n = j—/ and m; = [ja;]— [la;] for 1 < i < k. 
It is more likely that problem 3 (and many similar ones) was created 
following the idea of Dirichlet’s approximation theorem, but for us, the order 
was reversed: we first met the problem and only later found out about Dirichlet’s 
theorem. A bridge can be crossed in both directions. 
This is a very easy problem for someone who knows the Chinese remainder 
theorem. Namely, one can pick some k distinct primes, say pı ,...,px, then, by 
the mentioned result, one can conclude that there exists a positive integer x that 
solves the system of congruences x = —j (mod Pj), 1 <j < k; the numbers 
x + 1,..., x + k are not square-free and the problem is solved. One can prove 
in the same way that there exist k consecutive positive integers that are not mth 
power-free (each of which is divisible by a power of a prime with exponent 
at least m), or one can prove that there exist k consecutive positive integers 
each of which is not representable as the sum of two squares (this one is a bit 
more elaborate, but the reader will find her/his way in order to solve it; the only 
necessary result is the one stating that a positive integer cannot be represented 
as the sum of two squares whenever there is a prime congruent to 3 modulo 4 
such that its exponent in the factorization of that integer is odd). 

The original proof (from RMT, many years ago) makes no use of Chinese 
remainder theorem, but rather of a rudiment of it. Namely, we will only use the 
fact that if a and b are relatively prime, then the congruence ax = c (mod b) 
has solutions (or, we can say, ax generates a complete system of residues 
modulo b whenever x does the same). Although a bit more complicated, we 
consider this proof to be instructive; hence, we present it here. 

We proceed by induction. The base case is clear, so we assume that we 
dispose of k consecutive natural numbers n + 1,...,2 + k, each of which is 
not square-free, and let P; be some prime square that divides the jth number, 
for 1 < j < k. Let also pzi; be a new prime, different from all pj, 1 < j < k. 
Then all numbers xp? -epz +n +j, 1 <j < k, are not square-free, for every 
integer x, and we can choose x in such a way that xpi epp +n = —k-1 
(mod p? ,;). One sees that xp? ---p? +n +j, 1 <j < k+l, arek + 1 non- 
square-free numbers, as we intended to show. 

Can this argument be adapted to obtain a proof of the Chinese remainder 
theorem? You probably already noted that the answer is definitely yes. Do we 
have another bridge? Yes, we do. 


. The answer is /6— v2. Let / be the length of the side of an arbitrary equilateral 


triangle ABC whose vertices are within the closed surface bounded by a unit 
square. There is always a vertex of the triangle such that the triangle is situated 
in the right angle determined by the parallels through that vertex to a pair 
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of perpendicular sides of the square. Say A is this vertex, and note that the 
projections of the sides AB and AC of the triangle on two sides sharing a 
common vertex of the square have lengths /cos@ and /cos(z/6 — a), where 
a is the angle that one of them forms with the side on which it is projected. 
Also, 0 < a < 2/6. The projections being included in the corresponding unit 
sides of the initial square, we obtain 


Icosa <1 and Icos (= -a) <l. 


cos G -a) 


cosa 


However, one of œ and 2/6 — œ is at most 2/12, thus either cosa > 
cos(2/12) = (v6 + V2)/4, or cos(/6 — œ) > cos(/12) = (V6 + V2)/4. 


In both cases, by using one of the above inequalities, we conclude that 


Meee iedes V2. 


The existence of an equilateral triangle inscribed in the unit square with 
precisely this length of its side is easy to prove. The triangle has a vertex in 
a vertex of the square, the other two vertices on two sides of the square, and its 
sides emerging from that vertex form angles of measure 2/12 with the sides of 
the square. 


1 Mathematical (and Other) Bridges 


1/12 


A 


By the way, one can state the similar problem in which only the word 
“square” is replaced by “cube.” If ABCDA’B’C’D’ is a unit cube (with two 
opposite faces ABCD and A’B’C'D’ and AA’, BB’, CC’, DD’ parallel edges), 
one can immediately see that triangle ACD’ (for instance) is equilateral with 
J/2 as length of its side; also, one can readily conclude that there is no bigger 
equilateral triangle inscribed in a unit cube. As intuitive as this result might be 
(more intuitive than the planar case, isn’t it?), one needs a demonstration for it. 
The reader is invited to see that if S is the area of a triangle and S1, S2, and S3 are 
the areas of the projections of the triangle on the faces of a rectangular trihedron 
(that is, on three planes perpendicular to each other), then S? = SẸ + S3 + s 
(an extension of the Pythagorean theorem, yet some kind of a bridge). In our 
case, if S is the area of an equilateral triangle situated within a unit a cube and 
we project the triangle on the planes of three faces of the cube that share a 
common vertex, we have $? = S + S + S3 (S1, S2, and S3 being the areas of 
the projections), and, moreover, each S; < 1/2 (as being the area of a triangle 
within a unit square). Thus, we obtain S$? < 3/4, and, consequently, the length 
of the side of the equilateral triangle is at most /2, finishing the proof. Isn’t 
this a mathematical bridge (from plane to solid geometry)? 


. We prove the first part by induction on m. For m = 1, the equality is the given 


one; thus, let us suppose that it is true for m and prove it for m + 1. To do this, 
it suffices to multiply to the right with A” the relation AB — BA = A and to 
multiply to the left with A the induction hypothesis A”B — BA” = mA", and 
then add side by side the two equalities thus obtained. 

The second part relies on the first. From the well-known fact that XY — YX 
always has zero trace (for X and Y square matrices of the same dimension) and 
mA” = A™B—BA"™, we conclude that the trace of A” is 0 for all positive integers 
m. Ifa, ... , an are the eigenvalues of A, this means that af +- +a” = 0 for all 
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m, and a canonical application of Newton’s formulae shows that the symmetric 
sums of aj,...,@, are all O, therefore that a; = ... = a, = 0. Now ann xn 
matrix with null eigenvalues has characteristic polynomial X”; therefore, by the 
Hamilton-Cayley theorem, A” = O,, that is, A is nilpotent. 

We can also give an alternative proof of the fact that A is nilpotent, using the 
equality A”B — BA” = mA”. Namely, if one defines the norm ||X|| of a matrix 
X = (xi)isij<n by 


IXI = max |x| 
l<ij<n 


(with |x| being the usual norm of the complex number x), then one immediately 
sees that |X + Y|| < |X] + IYI; lloeX|] = læl||X||, and ||XY|| < n|[X|]|]¥| for 
any n x n complex matrices X and Y and any complex number œ. Thus we have 


m||A™|| = ||A"B— BA™ || < ||A™B|| + ||BA™|| < 2n|/A™||||B| 


for every positive integer m. By choosing a large enough m, we see that this 
implies ||A”|| = 0 for that m, therefore A” = O,, finishing this variant of 
the proof. (Of course, A” = O, can be also deduced if one only knows that 
A” = O, for some m, even though the original problem didn’t actually ask for 
that.) 
A third approach for solving this problem is presented in Chapter 3. 

7. Ifa < b < care positive and a‘ + ck = 2b*, then, by Jensen’s inequality for 

the convex function x > x“, we have 


k k k 
r ere 


2 
hence b — a > c — b. (Jensen’s inequality can be avoided; we can write 
ct — b = b — a 
in the form 
(c= b) (č! + Fb ee) = 6-084 ba H a7’), 


then use a < b < c.) 

Suppose now that nk < nk < +++ is an infinite arithmetic progression with 
terms from the sequence of powers with exponent k. According to the above 
observation, we obtain n? — nı > n3 — m > ---, which would be an infinite 
strictly decreasing sequence of positive integers—an impossibility. 

There is a second approach (that will be useful for the next problem—which 
we cannot solve by other means). Namely, if nk, nk, ... is an infinite arithmetic 
progression, then there are a and b such that nt = aj + b for all j > 1. Then 


16 
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because the k-series (with k > 1) converges. On the other hand, the generalized 
Co 
1 
harmonic series 5 ——— diverges to oo, as it is again well-known (and 
j=l 
one gets easily, by comparing with the harmonic series). Thus, we have a 
contradiction and the problem is solved. 


. We can start with 17, 52,7? as such a three-term progression. Or, if we want 


to avoid the term 1, we can choose 287, 42”, 14°—a progression with common 


difference 5-14” = 980. If we have the arithmetic progression nh ower nés with 
common difference d, then, for k = [k,,..., ks] (the least common multiple of 
k,...,k,) and n = nés + d, the numbers 

nn} S nn", ntt! 


are s + 1 powers with exponent at least 2 forming an arithmetic progression 
with common difference n*d. Thus, inductively, we get such progressions with 

as many terms as we want. 
However, an infinite such progression does not exist, due to the same reason 
that we used in the second solution of the previous problem. Indeed, if we had 
J 


n’ = aj + b for all j > 1, then, on one hand, we would have 


1 1 
De gg 


jz1 "j jel 


and, on the other hand, 


1 1 1 
Dg ts DE ee 


Jars n>2 k>2 n>2 


We have 


dai +: +a, g 
n 


0 < (Gn41 — an) (Qn41 — Gn—1) +++ (an+1 — 41) < (arn = 
and the conclusion 
lim (an+ı = Gn) (Gn-+1 == Gn—1) ee (an+ı = ai) =0 
noo 


follows by the squeeze principle. In order to infer this, one has to know (apart 
from the arithmetic mean-geometric mean inequality) that if lim a, = a, then 
noo 
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lim (a; +---+a,)/n = a, too (an immediate consequence of the Stolz-Cesaro 
n—> 00 
theorem) and, of course, to know that 0° = 0 (that is, to know that if lim x, = 
noo 
Oand lim y, = oo, then lim æ” = 0). 
noo noo 

If (an)n>1 is only increasing it can also have limit infinity. For a, = n, we 

obtain 


bn = (Qn+1 — Gn) (Qn+1 = An—-1) s (GQn+1 cae ai) =R n!, 
which has the limit oo. For a, = n/2 for even n and a, = (n + 1)/2 for odd n, 
(bn)n>1 is divergent, having a subsequence with limit 0, and another with limit 


1 
oo. Finally, for an = 1 + 5 +--+ — (the harmonic series), we have 
n 


0<b,= l k ae : + a 
"n+ 1\nt+1 n n+1 2 


1 2 -A 1 


< yas = ; 
n+ln 2 n+1 


hence lim b, = 0. So, anything can happen if we drop the convergence 
n—>Co 


condition for the initial sequence. (This part was not in the original problem. 
But, when you deal with mathematics for so long, you learn to ask—and this 
is not bad at all. In this case, we found the last limit, which we did not meet 
before.) 

This statement is sometimes called Croft’s lemma. We prove it by contradiction, 
namely, if we assume that f does not have limit 0 at infinity, there exist a positive 
number € and a sequence (x,) of real numbers, with limit infinity such that 
If (%n)| > € for all n. Because f is continuous, one can find intervals 7, such that 
Xn € I, and |f(x)| > € for all x € J, and all n. Clearly, the intervals J, can be 
chosen as small as we want, and in particular, we can assume that their lengths 
are all smaller than a fixed positive number a. Further, we use the following: 


Lemma. Let 0 < c < dand let a > 0 be given. Then, for every sufficiently 
large a > 0, any interval (a,b) of length b— a < a can be covered by a 
“multiple” of (c,d). (A “multiple” of (c,d) is n(c, d) = {nx | x € (c, d)}.) 


Proof of the lemma. Indeed, let us choose a > c(d+qa)/(d—c) andb < a+qa. 
Then 


a a ut 
— — — > es 
c d d 


is a rephrasing of the inequality satisfied by a; therefore, we have 


a b a ata 
> p 
c -d è d 


> 1, 


11. 


1 Mathematical (and Other) Bridges 


yielding the existence of an integer n between b/d and a/c; because b/d < n < 
a/c, we have nc < a < b < nd; hence (a,b) C n(c, d). 
Now back to our problem. Denote J, = (an, bn), where, of course, lim a, = 
noo 


lim b, = oo. Start with an interval J; = [c,d] C (p,q) (with cı < dı) and 
noo 


positive integers mı and n; (that exist according to the lemma) such that nı Jı 
includes Zm. Then find a closed interval J2 = [c2, d2] (with c2 < d2) such that 
nıJ2 is a subset of J,,,. This is possible: all we have to do is to choose cz and dz 
in such a way that 


Am, bm, 
— <2 < dy < — $ Gy, < mC2 < Nd2 < Bmg 


ni ni 


Note that, since Im, C mJ; means 
NC, < Am, < bm < ndi, 


cı < C2 < dy < dı and, consequently, J2 C Jı follows. Then (again, by the 
lemma) we can find mz and n such that n2J> includes [,,,, and we can pick nz 
and m as large as we want; hence, we choose m) > m; and m > nı. Then we 
define a nondegenerate compact interval J3 = [c3, d3] such that noJ3 C Im C 
noJ2; we conclude that J3 C J2. 

In general, we can define two increasing sequences of positive integers (m,) 
and (ng) and a sequence of compact nested intervals (Jg) satisfying 


MI 41 C Im, C Ik 


for all k. Now, the nested intervals Ją must have (at least) a common point fo 
that belongs to (p,q), also, because J; was chosen inside (p,q). Since ngto € 
Jeti C Im, we conclude that |f (ngto)| > € for all k, which is a contradiction 
to the hypothesis that fim f (nto) = O—and finishes our proof. 


Solution I. The solutions are second degree polynomial functions of the form 
f(x) = Ax? + Bx (taking 0 value for x = 0). For x = mt, y = t, z = t, with 
m E€ Zandt € R, we get the recurrence relation 


Fin + DA — 2f((m + It) + f(mt) = f (21) — 2f0) 


satisfied by the sequence (f (mt)), for every real t. Solving this linear recurrence, 
we obtain 


f2) — FO 2 raed WIC). 


Fmt) => 5 


for all reals ¢ and all integers m. In particular, f(m) = Am? + Bm holds for all 
m € Z, with 
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_ YO- 
= À , 


A= 


LQ“ 30 mag 


Next we have, for a nonzero integer n, 


fl) = (r -) a (ie E Ue 
n 


and 


pies (o ) = FCD- FUD on 4 SOD -AOD cay, 
n 


some tedious algebra shows that solving for f(1/n) and f(2/n) yields 


ie RE a) 


with, of course, the same A and B as before. Now replacing these in the above 
relation for f (mt), with t = 1/n, gives 


SA 8) 


for all integers m and n # 0. Or we can say that f(r) = Ar? + Br for every 
rational number r. For finalizing, there is now a standard procedure (based on 
the continuity of f). For an arbitrary real x, there exists a sequence (r„)n>1 
of rational numbers such that hm. Tn = x. As we have f(r,) = Ar? + Br, 


for all n and f is continuous, we can pass to the limit for n — oo and get 
f(x) = lim f(r,) = lim (Ar. + Br,) = Ax’ + Bx. 
noo n—>oo 


Solution II. Let, for any real x and y, f(y) = f(x + y) —f (x) —f(); of course, 
fO) = f(x) for all x and y. We see that the functional equation can be written 
in the form f,(x + z) = f(x) +f), for all x, y, z € R. Because fy is continuous 
and satisfies Cauchy’s functional equation, it must be of the form f(x) = kyx 
for all x, with a fixed real constant k,. Of course, the constant depends on y, and 
we rather prefer to use the functional notation ky = g(y). Thus f(x) = g(y)x 
for all x, y € R. From the initial equation, we get (for x = y = z = 0) f(0) = 0, 
therefore 0 = —f(0) = fo(x) = g(0)x for all x implies g(0) = 0, too. For 
arbitrary nonzero x and y, the equality g(y)x = g(x)y can be also expressed 
as g(y)/y = g(x)/x, which means that x +> g(x)/x is a constant function; 
thus, there exists k € R such that g(x) = kx for all x 4 0. However, this 
formula works for x = 0, too, as long as we know g(0) = 0. Thus, we obtained 


fa +y) -f — fy) = g(v)x = g(x)y = kxy for all real numbers x and y. 
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Now consider the (also continuous) function A defined by h(x) = f(x) — 
kx? /2 for all x, and note that it also satisfies Cauchy’s functional equation: h(x-+ 
y) = h(x) + h(y) for all x,y € R, therefore there exists B € R such that 
h(x) = Bx for all x. With A = k/2, we thus conclude that f(x) = Ax? + Bx for 
all real x, as in the first solution. 

Of course, any of the solutions must end with the verification of the 
fact that the functions given by such a formula are, indeed, solutions of the 
problem, which is not at all complicated. A moment of attention shows that the 
verification relies on two identities, namely, 


(X+ytz2+xetytz=(K+y4+04+27+04+2) 
and 
Qatyt+ytrtyt+2=4+y? + e497? + 4+2 


that hold for all x,y,z € R. Knowing these identities is, of course, helpful for 
solving the problem (at least for guessing the solutions)—also, they could lead 
us to some generalizations of it. We are sure that the reader already recognized 
some particular (and simple) cases of the identity that appeared in the solution 
of problem 5 from the text (the Erdé6s-Ginzburg-Ziv theorem). Actually, we 
are sure that the reader will be able to extend the above and solve the following 
more general problem: prove that the only continuous real functions f satisfying 


Do. Urry (Z=) =0 


SC{1,...,m} ics 


for all x1,...,Xm E R are the polynomials of degree at most m — | taking 
value 0 for x = 0. Of course, this generalizes (besides our problem) Cauchy’s 
functional equation for continuous functions, but actually, everything is based 
on it (so that, in the end, we don’t get much of a generalization)—the result 
follows inductively, in the vein of the second solution. However, this statement 
is a nice converse of the mentioned identity (that can be useful in one proof of 
the Erd6s-Ginzburg-Ziv theorem). 

As said before, in mathematics (and not only in mathematics) there are 
bridges everywhere. 
This is a clever application of the mean value theorem, combined with the 
intermediate value theorem for continuous functions. The second allows us to 
prove that f has a fixed point (just apply the theorem to the continuous function 
g defined by g(x) = f(x) — x; since g(a) > 0 and g(b) < 0O, there surely 
exists c € [a, b] with g(c) = 0)—this is actually a particular case of Brouwer’s 
fixed point theorem. So, there is c € [a, b] such that f(c) = c. The conditions 
f(a) = b + a and f(b) = a F b show that, in fact, c is (strictly) between a 
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and b. Now apply the mean value theorem to f on the intervals [a, c] and [c, b]; 
accordingly, there exist cı € (a,c) and c2 € (c, b) such that 


_fO-f@ _ c-b 


c—a c—a 


f(c) 


and 


_fO-fO_ a-e 


f(c) — hae 


Of course, f’(ci)f’(c2) = 1 follows. 
13. With the change of variable t = 1/2 — x, we obtain 


m/2 1 0 1 
—~ d=- dt 
Í 1 + (tan x) v2 : ie 1 + (tan(2/2—1))v? 


m/2 1 m/2 1 
= a= Co 
Í 1+ 1/(tant)v? o 1+1/(tanx)¥? 


Be (tan xv? T mi? 1 
= dx= —— — dk, 
o I+ (tan x) V2 2 o l+ (tan x) 2 


whence 


m 1 x 
dx = —. 
o 1+ (tanx)v? 4 


Of course, there is no special significance of the exponent J/2 in this 
problem, and, of course, one has to note that the function under the integral is 
always defined and continuous in the entire interval [0, 2 /2]—although it seems 
not to be defined at 7/2 (or at 0, if the exponent in place of /2 is negative). 

The trick that we learned from it is that you can make the change of variable 


b 
t =a+b—x inan integral f f(x)dx and thus infer the equality 


/ : f(dx = J f(a+b-x)dx, 


which is often useful in the evaluation of definite integrals—when other 
approaches fail. For example, one can calculate with this trick 


/4 
I In(1 + tan x)dx (= (7/8) In 2), 
0 
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or 


f xln(1 + e*)dx (= 1/3), 


1 


or one can find the useful result that the integral of an odd integrable function 
on an interval symmetric with respect to the origin is 0. 
First one sees that the function f defined by 


fis) = (1+ “ye 


has derivative 


on [0, co). Thus, in the case 0 < a < b, we have, by the monotonicity of the 
Riemann integral, 


(b-a) (1 4 “ye < f (1 + 2) edx < (b—a) (1 a. “en 


a 


The conclusion follows in this first case by using the squeeze theorem and the 
well-known fact that lim (1 + t/n)” = æ for every real number t. 
n—>Co 


Consider now that a < b < 0. We have (by changing the variable with 
x= —t) 


a 


[0s Yean- f(t)! ea= f (1- =)" eax 


and one can see that the function g defined by 
g(x) = (1 — =) e 
n 


is again decreasing (although not on the whole interval [0, 0o)) because it has 
derivative 


which is < 0 on [—b,—a] for n > —a. So, for such n, we have similar 
inequalities to those above: 


(b-a) e. (1+ =)" dr < (b—a) (1-2) e, 


a 


yielding the conclusion in this case, too. 
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Finally, when a < 0 < b, we can split the integral as follows: 


b xn 0 X\h_ b XN’ 
/ (1+=)e ‘ax = f (1+=)e ‘a+ f (14 =) ear, 

a n a n 0 n 
and we get the result by applying the previous, already proved, cases; accord- 
ingly, the integral has limit O—a+ b—0 = b—a, finishing the proof. Of course, 
the result remains true for a > b, too. 

The last few problems are from the category of nostalgic bridges—they 
could mean nothing to other people, although they are deeply deposited in 
our minds and souls. However, the reader will recall his own problems of this 
kind, and he or she will definitely agree with us when we say that nostalgic 
bridges appear at every step we take in this world, during our (more or less 
mathematical) lives. 


Chapter 2 
Cardinality 


We say that two nonempty sets A, B are equivalent or of the same cardinality or of 
the same power if there is a bijection from A to B. We write this as A ~ B. 


Iff : A — B is a bijection, then we also denote A L B. Note that “~” is an 
equivalence relation. Indeed, we have 


(reflexivity) A ~ A. 
e —1 
(symmetry) if A Ae B, then pe A. 
(transitivity) if A re B and BX C, then A ad C: 
The equivalence class A= {B | A ~ B} is called the cardinality of A, denoted by 
|A| or card A. 
A nonempty set A is called finite if A ~ {1,2,...,n}, for some positive integer n. 


In this case, A has n elements and we put |A| = n. 
For finite sets A, B with |A| = |B| and function f : A —> B we have: 


f injective <= f bijective < f surjective. 


As a nice application, we give the following: 


Problem. Let p and q be primes, p # q. Then for all integers 0 < rı < p—1, 
0 < ro < q -— l, there exists an integer n which gives the remainders 7,72 when 
divided by p and q, respectively. 


Solution. Denote by 
Z = {0,1,....P—V}  Zq = {0,7,....g-1} , Zoa = (0.7,....pq—1\ 


the remainder (or residue) class sets relative to p, q,, respectively pg. Remember 
that, for any positive integer m, we can define (on the set Z of the integers) the 
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relation of congruence modulo m by a = b mod m if and only if a — b is divisible 
by m (or, equivalently, if a and b give equal remainders when divided by m). This 
is an equivalence relation on Z, and the remainder (or residue, or congruence) 
class modulo m of the integer x (that is, its equivalence class with respect to the 
congruence relation) is readily seen to be the set? = {...,x—2n,x—n,x,x+n,x+ 
2n,...}. The set Z/mZ = Zm of all the residue classes ‘modulo m m is then a ring with 
espect to addition and multiplication defined by @ + b=a+banda-b = a-b. 
The reader is invited to verify that these operations are well defined (they do not 
depend on choosing the representatives of the remainder classes) and that they 
indeed provide a ring structure for the set Z,,. Also note that Zm = 0.1, sheds m—1 p 

Now we go on further with the solution of the problem and define the function 
Q : Zp > Zp X Zq, by the law o) = (F,T) , T € Zpq. We have 


[Zol = |Z, x Z| = pq. 
The surjectivity of g follows if ọ is injective. Thus, we have the implications 


0) =P) > FD = F,r) 


so F = r. This means that ọ is injective and consequently surjective. There is 
n € {0,1,...,pq— 1} such that 


p(n) = (71,72) S (4,0) = F,R) > n=7, and N=, 


thus p | n— rı and q | n — m. Observe that this argument can be easily 
extended in order to obtain a proof of the very useful Chinese remainder theo- 
rem: if a,,d2,...,d, are pairwise relatively prime integers, then for any integers 
bi, b2,...,bn, the system x = b,(mod)),...,x = b,(mod,) has a unique 
solution modulo aiaz... an. O 


If A, B are finite and there is an injective map f : A —> B, then we put |A| < |B|. 
If moreover there is an injective map g : B — A, then |A| = |B|. This result, the 
Cantor-Bernstein theorem, is difficult when A and B are infinite. Here is a proof. 


Theorem (Cantor-Bernstein). If A,B are nonempty sets and there are injections 
f:A— B, g: B — A, then |A| = |B], i.e., there exists a bijection ġ : A > B. 


Proof. We say that b € B is an ancestor of a € A if 


(gofogo...ofog)(b)=a, 


2k+1 times 
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for some k. Similarly, a € A is an ancestor of b € B if 


(fogofo...cgof\(a) =b, 


2k+1 times 


for some k. Moreover, a’ € A is an ancestor of a € A if f(a’) € B is an ancestor of 
a and b' € Bis an ancestor of b € B if g(b’) € A is an ancestor of b. 
Denote by Mı, M2, Moo the set of all elements of A which have an odd, even, 
respectively an infinite number of ancestors. Define analogously N1, N2, Noo for B. 
We prove that the function ¢ : A —> B, given by 


ga), x€ Mı UM% 


$0) = [A eM 


is bijective. In this sense, we prove that its inverse is y : B > A, 


f'0),yEN 


kas o yEMUNs © 


These functions ¢, w are well defined because f, g are injective. 
Let x € A. If x € Mı U Ma, then 


pa) = g'a) € M U Noo, 
so 
VOA) = gO) = 880) = x. 
If x € M, then f(x) = f(x) € Nı and 
VOOD =T OA =F) =x. 


Hence y of = 1z. 
Let y € B. If y € N4, then 


v(y) =f '0) EM 
and 


DWO = fWO)) =F O =v. 


Ify € Nx U No, then 


YO) = 80) € Mı U Mæ 
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and 


ov) = 8 (WO) = 8 E0) = y. 


Hence ¢ o y = 14. In conclusion, ¢~! = y and consequently, |A| = |B]. O 


This theorem allows us to define an order relation by the law 
|A| < |B| if and only if there is f : A — B injective. 
As a direct consequence, we have 
|A| < |B| if and only if there is g : B —> A surjective. 


Using Zorn’s lemma, one can prove that this order relation is actually total. A set A 
is called countable if A is equivalent to the set N of nonnegative integers. A is called 
at most countable if it is finite or countable. 

A set is countable if and only if its elements can be written as a sequence. This 
does not happen for the set of the reals or any of its (nondegenerate) intervals (see 
problems 2 and 7 below). 

Nevertheless, a countable union of countable sets is also a countable set. Indeed, 
let A = |J,» An, where each A, is countable. Let An = {dn1,dn2,...} be an 
enumeration of An, for every natural number n > 1, and note that 


411, 412,421, 413, 422, 431,... 


is an enumeration of A (basically, the same argument shows that the set of positive 
rational numbers is countable, as we will immediately see). Obviously, the result 
remains true if every A, is at most countable. 

For instance, the set Z of all integers is countable because 


Z = {0,1,-1,2, —2,3,—-3,4, —4,...}. 
We can also note that 


Z =|] {=n n+ 1,-n+2,..3, 


neN 


which is a countable union of countable sets. 
For the set Q of rationals we have the decomposition 


123 4 
Q = U LLE, 
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so Q is countable. In another way, the set of positive rationals can be ordered as 
follows: 


1/1 > 1/2 1/3 > 1/4 


L 7 L 7 
2/1 2/2 2/3 2/4 
lZ L i we es 
3/1 3/2 3/33 3/4. 

L fi L Z ras 
4/1 4/2 4/3 4/4 
lZ L z L 


5/1 5/2 5/3 5/4 


Each element appears many times in the table, but we consider each of them only 
for the first time. 
The set N x N of pairs of nonnegative integers is countable. Indeed, 


f:N>NxN, f@=(,0) 
is injective and 
e:NxN>N, g(m,n =2”.3" 


is injective. According to the Cantor-Bernstein theorem, N x N ~ N. In addition, 
note that the map 


o: N*xN* >N* , g(mn) = 2"! . (2n— 1) 


is bijective. One can even find a polynomial bijection between N x N and N, which 
we leave as an interesting exercise for the reader. 


Proposed Problems 


1. Let A be an infinite set. Prove that for every positive integer n, A has a finite 
subset with n elements. Deduce that every infinite set has at least one countable 
subset. 

2. Prove that (0, 1) is not countable. Infer that R and R \ Q are not countable. 

3. Let X, A, B, be pairwise disjoint sets such that A, B are countable. Prove that 


XUAUBS\XUA. 


Deduce that for every countable set B of real numbers, R \ B ~ R. 
4. Prove that N x N x N is countable and soisN x Nx... N. 
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10. 


11. 


12. 
13. 


14. 


15. 


16. 


17. 


18. 


19. 
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. Leta < b be real numbers. Prove that (0, 1) ~ (a,b) ~ R. 
. Let A be a countable set and a € A. Prove that A \ {a} ~ A. Is this result true 


for every infinite set A? 


. Leta < b be real numbers. Prove that [a, b] ~ [a, b) ~ (a, b] ~ (a,b). 
. Prove that every ¢-discrete set of real numbers is at most countable. (A set 


A C Ris called e-discrete if |a — b| > £, for any different elements a, b of A). 


. Let S be a set of real numbers with the property that for all real numbers a < b, 


the set S N [a, b] is finite, possibly empty. Prove that S is at most countable. Is 
every set S with the above property an e-discrete set, for some positive real £? 
Let S be an infinite and uncountable set of real numbers. For each real number 
t, we put 


S“ (A =SN(-co, 4, St = SN ft, 00). 


Prove that there exists a real number tọ for which both sets S7 (to) and S* (to) 
are infinite and uncountable. 

A set M of positive real numbers has the property that the sum of any finite 
number of its elements is not greater than 7. Prove that the set M is at most 
countable. 

Prove that the set of polynomials with integer coefficients is countable. 

Prove that the set of algebraic numbers is countable. Deduce that the set of 
transcendental numbers is not countable. (A real number « is called an algebraic 
number if there exists a polynomial P Æ 0 with integer coefficients such that 
P(a) = 0. Otherwise, œ is called transcendental.) 

Prove that for each set X, we have |X| < |P(X)|. We denote by P(X) the power 
set of X (that is, the set of all subsets of X, including the empty set and X). 
However, the set of all finite subsets of N is countable (thus |N| = |P(N))|). 
Let pı, p2, ..., pg be distinct primes. Prove that for all integers ri, r2, ..., Tk 
there is an integer n such that n = r;(modp,), for all 1 <i < k. 

Prove that there are no functions f : R —> R with the property 


Fœ -folz 1, 


for all x,y € R, x Æ y. 
Prove that there are no functions f : R —> R with the property 


POT a 
for all x,y € R, x £ y. 

Prove that the discontinuity set of a monotone function f : R —> R is at most 
countable. 

Prove that the set of all permutations of the set of positive integers is 
uncountable. 
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20. 


21. 


22. 


23. 


Let f, g be two real functions such that f(x) < g(x) for all real numbers x. Prove 
that there exists an uncountable set A such that f(x) < g(y) for all x,y € A. 

Let R be the real line with the standard topology. Prove that every uncountable 
subset of R has uncountably many limit points. 

Find a function f : [0, 1] — [0, 1] such that for each nontrivial interval J € [0, 1] 
we have f (I) = [0, 1]. 

Let a and k be positive integers. Prove that for every positive integer d, there 
exists a positive integer n such that d divides ka” + n. 


Solutions 


l. 


First we will prove by induction the following proposition: 
P(n) : “The set A has a finite subset with n elements.” 


The set A is nonempty, so we can find an element a, € A. Then A; = {a} isa 
finite subset of A with one element, thus P(1) is true. 
Assume now that P(k) is true, so A has a finite subset with k elements, 


Ax = {a1,@2,..., ak} CA. 


The set A is infinite, while A; is finite, so the set A \ A, is nonempty. If we 
choose an element az+1ı € A \ Ax, then the set 


Ax = {a1, 2, .. . , Ak, 41} 


is a finite subset of A, with k + 1 elements. Hence P(k + 1) is true. 

Further, we prove that A has a countable subset. As we proved, for every 
positive integer n, we can find a finite subset A, C A with n elements. Then the 
set 


S=|JAn cA 


n>1 


is an infinite subset of A. Moreover, S is countable, as a countable union of finite 
sets. 


. Let us assume by contradiction that A = (0, 1) is countable, say 


A= {x,|neN, n21}. 


Let us consider the decimal representations of the elements of A, 
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X1 = 0.a11412413 as Al yess 
X2 = 0.d21d22023... don... 


For each integer k > 1, we choose a digit denoted b; so that 
be Fae , De AO, De FO. 
Now let us define the number 
x = 0.bibzb3 ... bn... . 


Obviously, x € (0, 1), so there is an integer m > 1 for which x = xm. But the 
equality x = Xm implies 


0.bib2b3 Lo bn .. | = 0.dm1Am24m3 sUn ea 


which is impossible, because the decimal of rank m are different, bm Æ amm. 
Now, let us assume by way of contradiction that R = {x, | n > 1} is countable. 
Obviously, there exists a subsequence (xp, )n>1 Of (%n),>, such that 


(0,1) = {xn |n > 15. 


This means that (0, 1) is countable, which we just showed to be false. 

Finally, we use the fact that the union of two countable sets is also a 
countable set. If R \ Q is countable, it should follow that R = Q U (R \ Q) is 
countable, as union of two countable sets. This contradiction shows that R \ Q 
is not countable. 

. The sets A, B are countable so we can assume that 


A = {a),d2,...,4n,...}, B = {bi,bo,...,bn,...}. 
Let us define the map 
f:XUAUB—>XUA 


given by the formula 


Xx, ifxe Xx 
fŒ) = jan, ifxeA,x=a,. 
an—1, if x E€ B, x = b, 
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First, the map f is well defined: it takes (all) values in X U A. Hence it is 
surjective. For injectivity, note that f is injective on each restriction to X, A, 
and B. Then f is injective on X UA UB, if we take into account that any two of 
the sets 


f(X) =x, f(A) = {az,44,..., don... .}, f(B) = {a1, a3, ... , Gan-1,---} 


are disjoint. 

For the second part of the problem, let A be a countable subset of R \ B. 
This choice is possible, because the set R \ B is nonempty and infinite. If X = 
R \ (A U B), then 


R\B=XUA, R=XUAUB, 


with A, B countable, and the conclusion follows, according to the first part of 
the problem. 
4. We begin by proving the implication 


A~BS>AxC~BxC, 
for all sets A, B, C. Indeed, if f : A — B is a bijection, then the map 
g:AxC>BxC 
given by 
(a,c) = (f(a),c) , acA, cEC, 


is also a bijection, so A x C ~ B x C. 
We have already proved that N ~ N x N. According to the above remark, 


NxN~NxNxN. 
Finally, by transitivity, 
N~NxN~NXNXN, 


soN~NxNxN. 
In a similar way, the sets N x N x ... x N are countable, too. As well, note 
that 


f:NoONXxNxN , f(r) = (n,0,0) 
is injective and 
g:NxNxNON ,  g(m,n,p) = 2”-3"-5? 


is injective. The conclusion follows by Cantor-Bernstein theorem. This method 
can also be used to prove that N x N x ... x N is countable. 
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5. One idea is to search a linear function f(x) = mx + n from (0,1) onto 


(a, b). In order to determine the values m,n, we impose the condition f(0) = 
a and f(1) = b. It gives 


n=a „)”n=a 
m+n=b m=b-—a ` 
Consequently, the function f : (0, 1) —> (a, b), given by 
f(x) =(b-a)x+a 
is bijective, so (0, 1) is equivalent to every interval (a, b). 
For the other part, it is sufficient to prove that R is equivalent to some open 


interval. Indeed, we can see that the function ọ : R > (-3, 5) , given by 
(x) = arctan x is bijective. 


. Assume that A = {a, | n € N}, so that aọ = a. 


Then the bijection f : A \ {a} — A given by the formula f(a,) = an—1, 
n € N,n > 1, shows us that A \ {a} ~ A. 


a, d2 a3 a4 as wee 


L ££ LL L£ x 


ao dı a2 a3 a4 as Brae 


The result remains true if A is an arbitrary infinite set. 
Indeed, let B = {x, | n € N} be a countable subset of A. We choose xo = a, 
then define the function ¢ : A \ {a} — A by the formula 


x, xEA\B 
Xn-1,X€B,x=X,,n>1- 


b(x) = 


In a classical way, we can easily prove that ¢ is bijective. Moreover, we can 
indicate its inverse | : A > A \ {a} with 


x, xEA\B 
Xn+1 XEB, eae ae ea Oe 


g(x) = 


. We have already proved that A \ {x} ~ A, for every infinite set A and x € A. In 


our case, 
[a, b] ~ [a, b] \ {b} <= [a,b] ~ [a, b), 


[a,b] ~ [a,b] \ ta} > la, b] ~ (a, b] 
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and further 
(a, b] ~ (a, b] \ {b} © (a, b] ~ (a,b). 
Finally, from transitivity, 


[a, b] ~ [a, b) ~ (a, b] ~ (a,b). 


. For each element x € A, consider the interval 


I ( : +) 
v= (x= 4,x+ >). 
r*a 


If x,y € A, x Æ y, then & N I = Ø. 
Indeed, if there is c in J, N Z, then 


E €& 
E< |x- y| S|x-—c]/ +|y-—c]<7+7=6, 
k-3l <- tly <5 45 
which is false. Now, for each x € A, we choose a rational number ry € L. 
As we have proved, x Æ y = rx Æ ry, which can be expressed that the map 
go : A > Q given by the law ġ (x) = ry, for all x € A, is injective. Finally, A is 
at most countable because Q is countable. 


. For every integer n, we put S, = SM [n,n + 1]. According to the hypothesis, 


all sets S,, n € Z are finite. Thus the set 


sas. 


nEZ 


is at most countable, as a countable union of finite sets. 
The answer to the question is negative. There exist sets S with the property 
from the hypothesis, which are not e-discrete. An example is 


S = {lnn |n e N*}. 


First we prove that there exists r such that the set S7 (r) is infinite and 
uncountable. If we assume the contrary, then the decomposition 


S=|Js-@, 


nEZ 


is a countable union of at most countable sets. Hence S is countable, a 
contradiction. Let 


a = inf {r | S~ (r) infinite and uncountable} 
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and similarly, we can define 
B = sup {r | S*(r) infinite and uncountable} : 


where cases œ = —oo or B = œ are accepted. 

We prove that œ < 6. If B < a, then let p < t < a. According to the 
definition of a, the set S~ (f) is countable, and from the definition of £, the set 
st (t) is countable. Hence S is countable, as union of two countable sets, 


S=S (USHA). 


Consequently, œ < B. Then for every a < tọ < ß, the sets S7 (tọ) and S* (to) 
are infinite and uncountable, because 


Sto) 2 Sw), S (to) 2 ST(B). 


For each integer n > 1, define the set 
1 
Anr=sxEM|x> -}. 
n 


Easily, M = U An. We will prove that every set A, is finite or empty, so M is 
n>1 

countable as a countable union of finite sets. Now we can prove that A, has at 

most 7n elements. If for some n, the set A, has at least 7n + 1 elements, say 

X1,X%2,..-Xmt1 E A, then 


1 1 
X>, N> wee ys Xni’. 
n n n 


By adding, 
+1 


Tn 
Xi HX Ft + X41 > 


which is a contradiction. 
For each polynomial P € Z[X], 


P = ao + aX +-+ aX”, 
define and denote by 
h(P) =n + |ao| + lai] +++ + an| 
the height of P, h(0) = 0. Let us put for each nonnegative integer k, 


P, = {P € ZIX] | h(P) =k. 
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14. 


Each set Px is finite, possibly empty, so 


zx = |_) 


keN 


is countable, as a countable union of finite sets. Indeed, there are only a finite 
number of polynomials with h(P) = k. First, note that if h(P) = k, then 
deg P < k and |ao|, |ai|,..., |an| < k. Consequently, P is defined by a finite 
number of integer coefficients ao, 41, . . . , an which are less than or equal to k in 
absolute value. 

The set A of algebraic numbers is the set of real roots of all nonconstant 
polynomials with integer coefficients. Using this remark, we can write 


A= U {reR|P@) =9}, 


PeZ [X] 


where 
Z [X] = Z[X] \ {0}. 


Consequently, A is countable as a countable union of finite sets. Indeed, each 
set from the union has at most k elements, where k = deg P. 

The function @ : X > P(X) given by $(x) = {x}, for all x € X, is injective, 
so |X| < |P(X)|. Thus we have to prove that there are no bijections from X 
onto P(X). If we assume by contradiction that there is a bijection f : X > 
P(X), then define the set A = {x € X |x €f(x)}, A € P(X). Because of the 
surjectivity of f, we have A = f (xo), for some x9 € X. Now the question is 


xy €A or xy <A? 


If xo € A, then xo ¢ f (xo), false because f(x) = A. If xo ¢ A, then xo € f(x), 
false, because f (xo) = A. 

The last two implications follow from the definition of the set A. These 
contradictions solve the problem. 

One way to see that the second statement of the problem is true is to consider 
the function that maps every finite subset {n;,..., ng} of N to the natural 
number 2”! + - -- + 2”* (and maps the empty set to 0). This mapping is clearly a 
bijection (since every positive integer has a unique binary representation), and 
the conclusion follows. 

Or, one can see that this is an enumeration of all finite sets of N: 


Ø, {0}, {1}, {0,1}, {2}, {0,2}, {1,2}, {0,1,2}, {3}... 


(we leave to the reader to decipher how the sets are enumerated; we think that 
he/she will do). 
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16. 


17. 
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Finally, one can see that the set P(N) of finite parts of N is the union of the 
sets P;, where P; means the set of finite subsets of N having the sum of their 
elements precisely i; since every P; is finite, P(N) is countable (as a countable 
union of finite sets). The reader will surely find a few more approaches. 
Denote p = pip2...px. Let us define the function 


fi Zp > Zp, X Zp, X ... X Zp, 
by the formula 
f(rmodp) = (rmodp;,rmodp2,...,rmodp;,), reé {0,1,...,p—l}. 


The sets Z, and Zp, X Zp, X ... X Zp, have the same number of elements. In 
fact, the problem asks to show that f is surjective. Under our hypothesis, it is 


sufficient to prove that f is injective. In this sense, let r,s € {0,1,...,p— 1} be 
such that 
(rmod pi, r mod p>, . . . , r mod px) = (s mod pı, s mod p>, . . . , s mod px). 


It follows that 
rmodp; = smodpı,...,r mod pg = s mod px, 


orpilr—s, polr—s, ... , pelr — s. Hence p|r — s, which is equivalent to 
rmodp = smodp. This proves the injectivity of f. 

For each integer k, there exists at most one element f(x) € [k, k + 1). Therefore 
to each real number x, we can assign a unique integer k = k(x) such that f(x) € 
[k,k + 1). Thus, the function 


R>əx— kp ez 


is injective. This is impossible, because R is not countable and Z is countable. 

Another method uses the injectivity of f. Indeed, if x + y, then 
If(x) —f(y)| = 1, so the equality f(x) = f(y) is not possible. The inequality 
from the hypothesis says that the image of the function f is a 1-discrete set, so 
it is at most countable. Now, the map f : R — Imf is injective, so 


card R < card Imf. 
This is impossible, because R is not countable. 


Let us assume, by way of contradiction, that such a function f does exist. Define 
g : (—1, 1) > R, given by g(x) = f(x), for all x € (—1, 1). Then 


1 
Ig) — go) = 7 
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18. 


19. 


20. 


for all x,y € (—1,1), x Æ y. Indeed, 


lg) — gQ)| = 


1 
ety 2 


Now define h : R —> R by h = 2-(go@), where ġ : R > (-1, 1) is 


2 
(x) = — arctan x. 
T 


Finally, the function h : R — R satisfies 


|A(x) — A(y)| > 1, 


for all x,y € R , x Æ y, which is impossible, as we have seen in the previous 
problem. 

Let D be the discontinuity set of f. It is well known that D contains only 
discontinuities of the first kind if f is monotone. We mean that for each x € D, 
there exist finite one-sided limits denoted 


fa) = uny 0) . aa) = eed O). 


If f is increasing, then f,(x) < fa(x), with strict inequality if x € D. Now, for 
every x € D, we choose a rational number denoted 


rx € fsx), fa) 
and we define the function 
D>x EN r, EQ. 
It is injective because of the implication 


x<y = fax) Sf). 


Finally, D is countable, because Q is countable. 
Take any semi-convergent series of real numbers with general term a, (for 


instance, an = cM") and apply Riemann’s theorem: for any real number a 
there exists a permutation z of the set of positive integers such that a,(1) + 
ar) +... = a. This gives an injection from the set of real numbers into the 


set of permutations of the positive integers. Since the former is uncountable, so 
is the desired set. 

Let us consider a rational number r(x) between f (x) and g(x) and look at the sets 
A, of those real numbers a such that r(a) = x. The union of these sets (taken 
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21. 


22. 
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over all rational numbers x) is the set of real numbers, which is uncountable. So, 
at least one of these sets is uncountable, and it clearly satisfies the conditions. 
Suppose that A is a subset of R that has a countable set of limit points. The 
points from A split into two classes: those that are limit points of A (denote by 
B this set) and those that are not (let C be this second subset of A). But any 
point c from C must have a neighborhood that does not intersect A (with the 
exception of c), and this neighborhood may be chosen to be an open interval 
that contains c and has rational points as extremities. Therefore C is countable 
(possibly finite). 

So, if we assume that the set of limit points of A is at most countable, B is 
also at most countable; then, since A = B U C, the countability of A follows, 
and this proves the problem’s claim by contraposition. 

Let us consider on [0, 1] the relation “~” defined by 


a~bsa-beQd 


Clearly, this is an equivalence relation; thus we can find a complete system of 
representatives of the equivalence classes, that is, a set A C [0, 1] such that any 
distinct a,b € A are not in the relation ~ and for each t € [0, 1], there is a 
(unique) a € A for which t ~ a. We have 


[0,1] = |X. 


acA 


if we denote by X, the equivalence class of a. Since 
Xa = {y € [0, lly —a E Q} = {a+ tlt € QN [-a, 1 — a]}, 


we see that each class X, is dense in [0,1] and is a countable set. If A was 
countable, then [0, 1] would be countable, too, as a countable union of countable 
sets. Since this is not the case, we infer that A is not countable, hence a bijection 
ọ : [0,1] > A can be found. 

Then we can define the desired function f : [0, 1] — [0, 1] by setting 


fœ =t, 


for all x € X, 4) and for suitably chosen ¢ € [0, 1]. Since each x € [0, 1] belongs 
to exactly one set X, and for each a € A there is a unique t € [0, 1] such that 
a = g(t), the function f is well defined. 

Now, let J C [0, 1] be an interval which is not reduced to one point. Z contains 
elements from any set X,, a € A (since the classes of equivalence are dense in 
[0, 1]). An arbitrary ż € [0,1] being given, the intersection of J with Xg) is 
nonempty; so we can consider an x € 7 N Xy) for which we have f(x) = t. 
Thus, we see that f takes in J any value t € [0, 1], that is, the desired result. 
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This is problem 1795, proposed by Jeff Groah in Mathematics Magazine, 

2/2008. Two more solutions can be found in the same Magazine, 2/2009. 
We prove the slightly more general statement that, for positive integers a, k, d, 
and N, there exist positive integers n; 0 < i < d — 1 such that n; > N and 
ka"! + n; = i (mod d) for all 0 < i < d — 1. Of course, n = nọ is the solution 
to our problem. 

The proof of the general result is by induction on d. For d = 1, we have 
nothing to prove (and even for d = 2, one can easily prove the statement). So, 
let’s assume it is true for all positive integers d < D and deduce it for D. Also, 
assume that some positive integer N has been fixed. 

We can consider the (smallest) period p of a modulo D; that is, p is the 
smallest positive integer such that a”*? = a” for allm > M, M being a 
certain nonnegative integer. We then also have a”t!? = a” (mod D) for all 
nonnegative integers m > M and I. Yet, note that p < D because the sequence 
of powers of a modulo D either contains 0 (and then p = 1), or it doesn’t (and 
then, surely, p is at most D—1). Consequently, d = (D, p) is also less than D and 
we can apply the induction hypothesis to infer that there exist positive integers 
m; such that m; > max{M, N} and ka” + m; = i (mod d) for all0 < i < d—1. 

We claim that the numbers ka" +t + (m; + sp), with 0 < i < d — 1, and 
0 < s < D/d — 1 are mutually distinct modulo D. Indeed, suppose that 


ka"! tP + (mi + sp) = kat”? + (m; + tp) (mod D), 


for i,j € {0,1,...,d — 1}, and s,t € {0, 1,..., D/d — 1}. Since p is a period of 
a modulo D and m; are (by choice) greater than M, we get ka"! + (m; + sp) = 
ka"i + (mj + tp) (mod D), and because d is a divisor of D, this congruence is 
also true modulo d. Again by the choice of the m;, ka"! + (m; + sp) = ka™ + 
(mj + tp) (mod d) becomes i + sp = j + tp (mod d), and then i = j (mod d) 
(as d is also a divisor of p). But i, j are from the set {0, 1,...,d — 1}, thus i = j. 
Going back to the initial congruence, we see that it becomes sp = tp (mod D), 
yielding s(p/d) = t(p/d) (mod D/d). But p/d and D/d are relatively prime, 
hence we get s = t (mod D/d) which, together with s,t € {0,1,...,D/d—1}, 
implies s = ¢ and the fact that the two original numbers are equal. 

Thus we have the d- (D/d) = D numbers kat? + (m; + sp), with 
0 <i <d-—1,and0 < s < D/d -— 1 that are mutually distinct modulo D; 
therefore they produce all possible remainders when divided by D (and here is 
our cardinality argument; remember problem 15, that is, roughly, the Chinese 
remainder theorem). This means that we can rename by np, 0 < h < D—1, the 
numbers m; + sp,0 < i < d,0 < s < D/d—1 in such a way that ka” + nj, = h 
(mod D) for each 0 < h < D — 1, and this is exactly what we wanted to prove 
for completing the induction. 

This is Problem 11789, proposed by Gregory Galperin and Yury J. Ionin 
in The American Mathematical Monthly. A different solution by Mark Wildon 
appeared in the same Monthly from August to September 2016. Note, however, 
that our proof is nothing but a rewording of the official solution of the seventh 
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shortlisted number theory problem from the 47th IMO, Slovenia, 2006. (The 
shortlisted problems can be found on the official site of the IMO.) That problem 
asks to show that, given a positive integer d, there exists a positive integer n such 
that 2” + n is divisible by d. 


Chapter 3 
Polynomial Functions Involving Determinants 


Some interesting properties of determinants can be established by defining 
polynomial functions of the type 


f(X) = det(A + XB), (3.1) 
where A, B are n x n matrices with complex entries. Under this hypothesis, f is a 
polynomial of degree < n. The coefficient of X” is equal to det B and the constant 
term of f is equal to det A. Indeed, the constant term of the polynomial f is equal to 


S(O) = det(A + 0 - B) = detA. 


The coefficient of X” can be calculated with the formula 


1 1 
X = lim — det(A + xB) = lim det € jy B) = det B. 
x—>o x! x—> 00 x 


In the case n = 2, the polynomial f is of second (or first) degree, so we can use 
the properties of quadratic functions, respectively linear functions. 

A special case of such polynomials is the characteristic polynomial y4(X) = 
det(X/,, — A). Its zeros are called the eigenvalues of A. Using the theory of linear 
systems, one shows that if A € M,,(C), then A € C is an eigenvalue of A if and only 
if there exists a nonzero vector v € C” such that Av = Av. One can give a more 
abstract interpretation of the above objects, using vector spaces. Namely, if V is a 
vector space over a field F, then any linear map f : V —> V can be represented by a 
matrix if the dimension of V is finite, say n. Indeed, once we fix a basis e1, €2,..., €n 
of V, f is determined by its values on e1, . . . , €n, thus by a matrix A such that f(e;) = 

n 


> ajie; for all j. Because for any other basis e/ , e}, . . . , e’, in which one can find an 
j=1 
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n 
invertible matrix P € M,,(F) such that e; = >» Pjiej, one can easily see that two 
j=1 
matrices A, B represent the same linear map f a two different bases of V if and only 
if there exists P € GL, (F), the set of invertible matrices in M,,(F), such that A = 
PBP~'. In this case, we say that A and B are similar. If f is a linear map represented 
by a matrix A, we then call the polynomial y(X) = X4(X) = det(X/,, — A) the 
characteristic polynomial of f . It is easy to see that yz does not depend on A, because 
two similar matrices have the same determinant. 

Before passing to some problems, let us point out some other useful facts. First, 
observe that if A, B € M,,(C) and A is invertible, then AB and BA are similar. Indeed, 
AB = A(BA)A™!. Thus y4s = za in this case. But this result holds without the 
assumption A € GL, (C). Indeed, letz € C and observe that there are infinitely many 
€ € C such that A + el, € GL,(C), because det(A + XI) is a nonzero polynomial. 
Actually, for the same reason, there is such a sequence e with im Ek = 0. Let 


Ay = A+ eéJ,; then the previous argument shows that 
det(z/, — A,B) = det(zl, — BAx). 
It is enough to make k — oo in the previous relation to deduce that 
det(z/, — AB) = det(z/,, — BA), 
and so yag = YXpa. The reader should try to understand this technique very 
well, since it can be used in many situations. Finally, take A € M,(C) and let 


41, A2,...,An be its eigenvalues, counted with multiplicities. By definition, 


det(XI, — A) = OHIO sa): 


On the other hand, 
X— a —daj2_ -... An 
— X— wa S 
det(XJ, — A) = a21 a22 a2n 
—dn1 —an2 .«.. X— Ann 


and a direct expansion shows that the coefficient of X"~! is 
(ay, + a22 +.. + ann) => —tr(A). 
Thus the trace of a matrix equals the sum of its eigenvalues, counted with 


multiplicities. Similarly, the determinant is the product of eigenvalues. Also, if 
€1,..., Ek are the kth roots of unity, we have 


45 


3 Polynomial Functions Involving Determinants 


k 


det(X*J, — A‘) = det | | [X — £;A) 
j=l 


k 


A x 
= | [dn — 6A) = [| (sr (D 


j= j=1 


k 
-JIT ie x)= TIJE- 2a) = 0*- 125). 


i=1 j=1 


n 


Thus det(Y/,, — A*) = [[@ — 2%), which means that the eigenvalues of A‘ are 


i=1 
the kth powers of the eigenvalues of A. Similarly, one can show that if g € C[X] 


then the eigenvalues of g(A) are g(A;) for j = 1,2,...,n. The reader is encouraged 


to prove this. 
Problem. Let A, B € M2.(R) such that AB = BA. Prove that 


det(A” + B’) > 0. 


Solution. Let us consider the quadratic function 


f(x) = det(A+xB) , xeR. 


with real coefficients. More precisely, let a, b,c € R be such that 


f(x) =ar+bxte. 


Then 
det(A? + B?) = det(A + iB) - det(A — iB) = f(i)f(—i) 


= (-a + bi + c)(—a — bi +c) = (c — a)? — (bi)? 


= (c-a? +b > 0. O 


Of course, the previous result holds for n x n matrices. Indeed, the fact that 
f@Òf(—i) is a nonnegative real number is not specific for polynomials of degree 
at most 2, but for any polynomial with real coefficients: all we need is to note that 


FOF-) = FOP. 
Recall the following basic property of the quadratic function f(x) = ax? +bx+c 


A 
X=y or x+y=-——]. 
a 


by eR, fe) =f6)] > | 
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In particular, if for some x, y € R, x Æ y, we have f(x) = f(y), then the abscissa of 
y 


: : >A 
the extremum point of the corresponding parabola is 


Problem. Let A, B € M2(R). Prove that 


2AB + 3BA 3AB + 2BA 
det (1 TA a) = det (r ¥ a) 


5 5 
Solution. Let us define the quadratic function (or linear, if det(BA — AB) = 0) 
F(x) = det(, + AB + x(BA — AB)). 
Because AB and BA have the same characteristic polynomial, 
det( + AB) = det(h + BA), 
so f(0) = f(1). But f is a quadratic function, thus 
fa) =fd-x), VxeR. 


In particular, 


and the conclusion is clear. Note that the same argument shows that 


‘AB BA AB BA 
det (n + Et) = det (1 F | 
x+y x+y 


for all real numbers x, y with x + y 40.0 


The following problem is also based on properties of the polynomial det(A + xB). 
However, the arguments are more involved. 


Problem. Let n be an odd positive integer and let A,B € M,,(IR) such that 
2AB = (BA) + I,. Prove that det(/, — AB) = 0. 


Solution. Consider the polynomial f(X) = det(X/,, — AB). We know that AB and 
BA have the same characteristic polynomial, therefore f(x) = det(x/,, — BA). Using 
the given relation, we conclude that 


2 2 
(HH) Z da (: = a? -aB) = det k > es v + (s4)?)) 


SS 
san 
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Because deg(f) is odd, f has at least one real root x. The above relation shows 


+ . 
that all terms of the sequence xı = x, X41) = — are roots of f. This sequence 


is increasing and takes only a finite number of distinct values, because f is not 0. 
Thus there exists n such that x, = x„+1. This implies x, = 1 and so 1 is a root of f, 
which means that det(/,, — AB) = 0. This finishes the proof. O 


The following problem was given in the Romanian Mathematical Olympiad 
in 1999: 


Problem. Let A be a 2 x 2 matrix with complex entries and let C(A) be the set 
of matrices commuting with A. Prove that | det(A + B)| > | det(B)| for all B € C(A) 
if and only if A? = Op. 


Solution. For one implication, let r1, r2 be the roots of the polynomial det(X/) — 
A). By taking the matrices B = —rjl we deduce that |r;|? < 0 and so r; = 0. Thus 
det(Xh — A) = X? and from Hamilton-Cayley’s relation, it follows that A? = Op. 
Conversely, suppose that A? = O2. We will prove more, namely, that det(A + B) = 
det(B) for all B € C(A). If det(B) 4 0, we can write det(—B*) = det(xA + B) - 
det(xA — B) for all complex numbers x. This shows that the polynomial det(XA + B) 
divides a nonzero constant polynomial, so it is constant. Thus det(A + B) = det(B). 
Now, suppose that det(B) = 0. There exists a sequence x, that converges to 0 and 
such that B + xah is invertible for all n. These matrices also belong to C(A) and so 
det(Xpl2 + A + B) = det(x„h + B). It is enough to make n — oo in order to deduce 
that det(A + B) = 0 = det(B). This finishes the proof. 0 


The following problem, from Gazeta Matematica’s Contest is a nice blending of 
linear algebra and analysis. 


Problem. Let A, B € M,(R) be matrices with the property that 
| det(A + zB)| < 1 
for any complex number z such that |z| = 1. Prove that 
(det(A))” + (det(B))? < 1. 


Solution. Consider the polynomial f(z) = det(A + zB). As we have seen, f can 
be written in the form det(A) + --- + det(B)z”. Now, we claim that if a polynomial 


g(X) = an X” +--+ + aX + ao 
has the property that |g(z)| < 1 for all |z| = 1, then 


laol? + Jay |? + +++ + lanl? < 1. 
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Indeed, a small computation, based on the fact that Je edt = O if n Æ 0, shows 
that 


1 2x . 
lao? + lai? +++ + lanl? = =| Ig(e")|?dt < 1. 
27 0 


Using this, we can immediately conclude that (det(A))? + (det(B))? < 1.0 


Here is an outstanding consequence of the fact that a nonzero polynomial has 
only finitely many zeros: 


Problem. Let A,B € M,(C) be matrices such that AB — BA = A. Prove that A 
is nilpotent, that is, there exists k > 1 such that Ak = O. 


Solution. Let us prove by induction that 
A‘B — BA* = kA* fork > 1. 


For k = 1, this is clear. Assuming that it is true for k, then A‘t!B — ABA‘ = kA‘T!, 
But AB = BA + A, hence A*t! B — BAT! = (k + 1)A**!, 

Now, consider the map ¢ : M,(C) > M,(C), 6(X) = XB — BX. We obtain 
(A*) = kA* for all k. Now, view ¢ as a linear map on a finite dimensional vector 
space M,(C). If A is not nilpotent, it follows that k is an eigenvalue for @ and 
this is true for all k. This is impossible since it would follow that the characteristic 
polynomial of ¢, which is nonzero, has infinitely many zeros, namely, all positive 
integers k. O 


Proposed Problems 


1. Let A,B € M2(R) be such that AB = BA and det(A? + B?) = 0. Prove that 
detA = det B. 

2. Let A,B € M;(R) be such that AB = BA,det(A? + B°) = 0, and 
det(A — B) = 0. Prove that det A = det B. 

3. Let U, V e M,(R). Prove that 


det(U + V) + det(U — V) = 2 det U + 2 det V. 
4. Let A,B € M2(R) be such that det(AB + BA) < 0. Prove that 
det(A? + B?) > 0. 
5. Let A,B € M>(Z) be two matrices, at least one of them being singular, i.e., of 


zero determinant. Prove that if AB = BA, then det(A? + B?) can be written as 
the sum of two cubes of integers. 
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6. 


10. 


11. 


12. 


13. 


14. 


Let A be a 2 x 2 matrix with rational entries with the property that 
det(A* — 2/5) = 0. 


Prove that A? = 2h and det A = —2. 


. Let A,B € M3(R) be such that det(AB — BA) = 0. Prove that 


‘AB BA AB BA 
det (1 + a) = det (i + ma , 
x+y x+y 


for all real numbers x, y with x + y Æ 0. 


. Let A,B € M2(R) be such that det(AB — BA) < 0. Prove that 


AB + BA 
det(h + AB) < det | h + —;z; 3 


. Let A, B, C e Ma (R) be pairwise commuting matrices so that 


det(AB + BC + CA) <0. 


Prove that det(A? + B? + C?) > 0. 
Let B be an n x n nilpotent matrix with real entries. Prove that 


det(J, + B) = 1. 


(Ann x n matrix B is called nilpotent if Bt = O,, for some positive integer k). 
Let A be an x n matrix with real entries so that A*t! = O,,, for a given positive 
integer k. Prove that 


det | 7, EJ La Ly =1 
et nay ay aaa a = dl. 


Let A € M,(C) and k > 1 be such that A‘ = O,. Prove that there exists 
B € M, (C) such that B? = I, — A. 

Let A be a 3 x 3 matrix with integer entries such that the sum of the elements 
from the main diagonal of the matrix A? is zero. Prove that we can find some 
elements of the matrix Af whose arithmetic mean is the square of an integer. 
Let A,B € M,(R) and suppose that there exists P € M,,(C) invertible such 
that B = P~'AP. Prove that there exists Q € M,(R) invertible such that 


B=0Q-'AQ. 
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Solutions 


1. Let us consider the polynomial f(X) = det(A + XB). We know that f can be 
represented as 


f(x) = det B- x? + mx + det A, 
where m € R. But det(A? + B?) = 0, so 


det(A + iB) det(A — iB) = 0 > f(i) = f(—i) = 0, 


hence f(x) = a(x? + 1), implying detA = det B (= @). 
2. The polynomial f(X) = det(A + XB) has degree < 3. From 


det(A? + B?) = det(A + iB) det(A — iB) = 0, 
we deduce f (i) = f(—i) = 0, so f is divisible by x? + 1. Hence 
f(x) = @ + Dime +n), 
for some real numbers m,n. Further, f(—1) = det(A — B) = 0, som = n and 
then f(x) = m(x? + 1)(x + 1). Now we can see that det A = det B (= m). 
3. The quadratic function f(x) = det(U + xV) can be written as 
f(x) = det V -x + mx + det U, 
for some m. Further, 
det(U + V) + det(U — V) = fC) + f(- 1) 
= (det V + m + det U) + (det V — m + det U) = 2(det U + det V). 
4. Let us define the quadratic function 
f(x) = det(A? + B? + x(AB + BA)). 
We have 
f(1) = det(A” + B? + AB + BA) = det(A + B} > 0 
and 


f(—1) = det(A? + B? — AB — BA) = det(A — B} > 0. 
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Now, note that if det(AB + BA) < 0, then f has a maximum. From f(—1) > 0 
and f(1) > 0, it follows that f(x) > 0, for all real numbers x € [—1, 1]. In 
particular, 


f(0) = 0 & det(A? + B’) > 0. 
In case det(AB+ BA) = 0, we also have f(0) > 0, because f is a linear function 
(possibly constant). 
We also can proceed using the identity 
det(U + V) + det(U — V) = 2detU + 2det V. 
By taking U = A? + B?, V = AB + BA, we obtain 
det(A? + B? + AB + BA) + det(A? + B? — AB — BA) 

= 2det(A” + B?) + 2 det(AB + BA) 

or 
det(A + B)? + det(A — B}? = 2 det(A? + B?) + 2 det(AB + BA). 

It follows that 

det(A? + B?) + det(AB + BA) > 0, 
which finishes the alternative solution. 

5. By symmetry, we may assume that det(B) = 0. Consider the linear function 
f(x) = det(A + xB). It can be also written as a + bx for some integers a, b. 
If z is a root of the equation z? + z + 1 = 0, then the fact that AB = BA 
implies the identity A> + B? = (A + B)(A + zB)(A + 2B). Thus, det(A? + 
B?) = fA) (Of (2) = (a + b) (a + bz)(a + bz?) = a + b’, which shows that 
det(A? + B?) is indeed the sum of two cubes. 

6. Let us define the quadratic function f(x) = det(A — xh), for all real x. The 
coefficient of x? is equal to det(—J,) = 1 and 


0 = det(A? — 2/5) = det(A — V2/,) det(A + V2h), 


so f(—V2)f(/2) = 0. Now, because f has rational coefficients, we conclude 
that f(./2) = f(—V2) = 0, so f(x) = x? — 2. Further, 


det(A — xh) = x* — 2, (3.2) 


for all real numbers x. For x = 0, we obtain detA = —2. The relation (3.2) is 
the characteristic equation of A, so A verifies it, A? — 2h = 0 > A* = 2h. 
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7. Let us consider the polynomial 


P(t) = det; + tAB + (1 —1)BA). 
We have 
P(t) = det(t(AB — BA) + h + BA). 


The coefficient of t? is det(AB — BA) = 0, so degP < 2. From the equality 
P(0) = P()), it follows that 


eye), 


for all real numbers A. The conclusion follows by taking 


Ere 
2 x+y 


. The axis of symmetry of the parabola y = f(x), where 


f(x) = det(x(AB — BA) + h + BA). 
1 
is x = 7 because of f (0) = f (1) and the coefficient of x” is det(AB — BA) < 0, 
1 1 
so f(x) <f (;) , for all x € R \ {1/2}. In particular, f(1) < f (5) which is 


AB+ BA 
det(J, + AB) < det (n + a) ; 


In case det(AB — BA) = 0, f is linear and f(0) = f(1), so f is constant. 
Consequently, 


AB+ BA 
det(h + AB) = det (1: + a.) ‘ 


2 


. Consider the polynomial function 


f(x) = det(A? + B? + C? + x(AB + BC + CA)). 
The coefficient of x? is equal to det(AB + BC + CA) < 0, thus the parabola 
y = f(x) is concave. We will prove that f(—1) > 0 and f(2) > 0, and then it 
will follow that f(0) > 0. Indeed, we have: 
f (2) = det(A? + B? + C? + 2AB + 2BC + 2CA) = det(A + B + C) > 0. 


On the other side, 
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f(—1) = det(A? + B? + C? — AB — BC — CA) 


1 
z det[(A— By + (B—C)’ + (C-—A)’]. 
Now, if X = A — Band Y = B—C, then C— A = —X — Y and further, 


1 
f(-l) = 7 det [X?+¥? + (X+ ¥)*] = det(X? + XY + Y’) 


2 
Ley 3 
=det| (X+-Y] + V3, = 0, 
2 2 
by the first problem of the theoretical part (note that X and Y commute). 
10. Let us define the polynomial function 
P(x) = det, —xB) , x €C. 


We will prove that P is constant and so P(—1) = P(0) = 1. If Bt = O,, then 
for all complex numbers x, we have 


I, = In — (ZB) = Un — BY In + B+ + BY). 


By passing to determinants, we deduce that P is a divisor of 1 in C[X], so it is 
indeed constant. 
11. Consider the polynomial function 


xk 
P(x) = det ( I, + ŽA #42 — At 
(x) = de ( + T + 7 peor m I 


The degree of P is not greater than kn and P satisfies the relation: 
Pœ)PO) = PŒ + y), 


for all real numbers x, y. Indeed, 


ko; ko; 
P(x)P(y) = det (x sa) - det 5 Za 


SN SA xy \ ap 
= det ae : 2 = det 5 Y= A 


! i!j! 
j=0 p=0 \i+j=p J 


k 
EYE p 
= det rae = P(x+y). 
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12. 


13. 
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In particular, P(x)P(—x) = P(0) = 1, so P is a constant polynomial, because it 
does not have complex roots. It follows that 


P(1) = PO) = 1. 


We can also proceed in a different way. According to the hypotheses, A is 
nilpotent. The sum of nilpotent matrices which commute with each other is 
a nilpotent matrix. In particular, the matrix 


Pa bp Lak 


is nilpotent. As we have proved in the previous problem, 


ANE E Ez 


We search for B in the form B = P(A) for some P € C[X]. First, let us prove 
by induction on k that we can find P € C[X] such that X*|P? (X) + X — 1. Take 
P, = 1 and assume that P; is found. Let P?(X)+X—1 = X*Q(X). If Q(0) = 0, 
take P41 = Px, so assume that Q(0) Æ 0. Let us search for 
Pui (X) = P(X) + aX“. 
Then 


Pig (X) +X — 1 = P(X) +X — 1 + 20X*P;,(X) + a? X* 
= X*(Q(X) + 2aP;(X) + a7X*), 
It is enough to choose œ such that Q(0) + 2aP;,(0) = 0, which is possible 
because P?(0) = 1 4 0. 
Now, let B = P(A). We know that 
P2(X) + X—1= Q(x) -x* 


for some Q € C[X]. Thus B? + A — I, = A‘Q(A) = 0 and so B? = I, — A. 
Let f(X) = det(Xh — A) = (X — A1)(X — A2)(X — A3) and observe that 


det(XI3 — A”) = (X — AT)(K — AZ)(K — 12). 
Thus the condition tr A? = 0 becomes Aj + AZ + AZ = 0, and this implies 


AS + AS + 1$ = 3471513. 


Solutions 55 


14. 


This can be translated as trA® = 3(detA)* and shows that the arithmetic 
mean of the elements on the main diagonal of Aĉ is a square, namely, (det A)’. 
Observe that we can write P = X + iY with X, Y e M, (R). Because PB = AP 
and A,B € M,,(R), we deduce that AX = XB and AY = YB. We are not 
done yet, because we cannot be sure that X or Y is invertible. Nevertheless, the 
polynomial function f(x) = det(xY +X) is not identically zero because f (i) 4 0 
(P being invertible). Therefore, there exists x € R such that f(x) # 0. Clearly, 
Q = X + xY answers the question. 


Chapter 4 
Some Applications of the Hamilton-Cayley 
Theorem 


We saw in a previous chapter that the characteristic polynomial of a matrix and its 
zeros, the eigenvalues of the matrix, can give precious information. The purpose of 
this chapter is to present a powerful theorem due to Hamilton and Cayley that gives 
an even stronger relation between a matrix and its characteristic polynomial: in a 
certain sense, the matrix is a “root” of its characteristic polynomial. After presenting 
a proof of this theorem, we investigate some interesting applications. 


Theorem (Hamilton-Cayley). Let K be any field and let A E M,(K) be a 
matrix. Define 


det(XI, — A) = X” + a1 X"! +--+» + aX + a 
the characteristic polynomial of A. Then 
A” + a14"! +++» + aA + aol, = On. 


The proof of this result is not easy and we will prove it here only for the field of 
complex numbers (which is already nontrivial!). Basically, if A is diagonal, the proof 
is immediate. So is the case when A is diagonalizable, because by a conjugation, 
one easily reduces the study to diagonal matrices. In order to solve the general 
case, we will use a density argument: first of all, we will prove that the set of 
diagonalizable matrices over C is dense in M,,(C). This is not difficult: because the 
characteristic polynomial of A has complex roots, there exists a complex invertible 
matrix P such that P~'AP is triangular. But then, it is enough to slightly modify the 
diagonal elements of this triangular matrix so that they become pairwise distinct. 
The new matrix is diagonalizable (because it is triangular and has pairwise distinct 
eigenvalues), and it is immediate that it can approximate the initial matrix arbitrarily 
well. Thus, the claim is proved. Now, consider A a complex matrix and take A, a 
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sequence of diagonalizable matrices that converge to A. Let Xa x +e ae 
be their characteristic polynomials. We know that 
bk) an= k 

At + dQ At! 4 aP ln = On, (*) 


by the argument in the beginning of the solution. On the other hand, the coefficients 
a® are polynomials in the coefficients of the matrix A; (simply because each 
coefficient of the characteristic polynomial is a polynomial in the coefficients of 
the matrix, as can be easily seen from the formula of the determinant). Because A, 
converges to A, it follows that each a” converges to ax, the coefficient of X* in the 
characteristic polynomial of A. But then, by taking the limit in the relation (*), we 
obtain exactly the desired equation satisfied by A. This finishes the proof. L 

Now, observe that one can easily compute a,_, in the above formula: all we have 


to do is to develop the determinant of X/, — A and take the coefficient of X”~!. Since 


det(X/, —A) = > e(o) j (Xio) = A1o(1)) tae (X8no(n) = Ano(n)) 


o€Sy, 
(here, by equals 1 if i = j and 0 otherwise, while e(o) = 1 if ø is even and —1 
otherwise), the only term which has a nonzero coefficient of X"~! is (X — a1,)(X — 
an2) . . . (X — ann), which gives the coefficient — (a11 + a22 +- - -+ ann). We recognize 


in the last quantity the opposite of the trace of A. Thus, the Hamilton-Cayley relation 
becomes 


A” —tr(A)- A"! + --- + (—1)" det(A)I, = On. 


In the very easy case n = 2 (for which the proof of the theorem itself is 
immediate by direct computation), we obtain the useful formula 


A? — tr(A)A + det(A)h = Op. 


Note already that the Hamilton-Cayley theorem comes handy when computing 
the powers of a matrix. Indeed, we know that A satisfies an equation of the form 


A” + nA"! + +++ + aA + aoln = On. 
By multiplying by A‘, we obtain 

AME ay jA"HET 4 06+ + ag Ak = On, 
which means that each entry of (A®)ps1 satisfies a linear recurrence with constant 
coefficients; thus we can immediately obtain formulas for the coefficients of the 


powers A*. The computations are not obvious, but in practice, for 2 x 2 and 3 x 3 
matrices, this gives a very quick way to compute the powers of a matrix. Let us see 


4 Some Applications of the Hamilton-Cayley Theorem 59 


in more detail what happens for 2 x 2 matrices. By the previous observation, to each 
matrix A € M2(C), we can inductively assign two sequences (an)„>1 , (bn)n>1 Of 
complex numbers for which A” = aA + byl, for all positive integers n. Let us find 
a recurrence satisfied by these sequences. If we assume that Ak = aA + bih, since 
we also have A? = œA + Bh, where œ = trA and B = — det A, we can write 


AT! = AŽ . A = (aA + bgh)A = aA? + kA 
= ar(aA + Bh) + bA = (aga + by)A + a Bh, 


so AKT! = ayy 1A + bei b with 


ak+1 = agar + by 
bya = af 


Here is an application of the above observation: 


Problem. Let A,B € M2(C) be two matrices such that A”B” = B"A” for some 
positive integer n. Prove that either AB = BA or one of the matrices A”, B” has the 
form aly for some complex number a. 


Solution. In the framework of the above results, this should be fairly easy. We 
saw that we can write A” = aA + bh and B” = cB + dh for some complex 
numbers a, b, c, d. The condition A”B” = B”A” becomes, after simple computations, 
ac(AB — BA) = O2. It is clear that this implies AB = BA or a = 0 orc = 0. But 
this is precisely what the problem was asking for. O 


The Hamilton-Cayley theorem is not only useful to compute powers of matrices 
but also to solve equations. Here are some examples. 


Problem. a) The solutions of the equation 
X =-h, Xe M,(C) (4.1) 


are X = ih, X = —il, and any matrix of the form 
c —a 


X= K a with a2 + be = —1. 


b) The solutions of the equation X? = h, X € M,(C) are X = h, X = —h and 
any matrix of the form 


c —a 


x= ae witha? + be = 1. 
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Solution. First of all, note that part b) follows immediately by using the next 
equivalence: 


X = -h © (iX =h. 
By considering the determinant, detX? = 1, we deduce that detX = 1. We 
therefore have two cases. In the first case, det X = 1, and with the notation œ = tr X, 
we have 
X —aX +h = 0. 
Combining this with equation X? = —h and taking into account that X is invertible, 


we obtain œ = 0. Now, with det X = 1 and tr X = 0, it follows that 


c —a 


x-(¢ at with a? + bc = —1. 


It is easy to check that all these matrices are solutions. In the second case, 
det X = —1. With œ = tr X, we have 


X —&X -h = 0. 


2 
Therefore, since X? = —h, we have X = ——h. By considering the trace, it follows 
a 


4 
that &« = —— > a = +2i. Finally, X = ih or X = —ih. O 
a 


With these preparations, we can solve the equation 
AB + BA = O. (4.2) 


when both matrices A and B are invertible. 


Problem. The solutions of the equation AB + BA = QO), with A,B € M.(C) 
invertible, are 


where x, y, z, a, b, c are any complex numbers with x? + yz 4 0, a? + bc Æ 0, and 
2ax + bz + cy = 0. 


Solution. We can assume, without loss of generality, that det(AB) = 1, by 
multiplying eventually by a scalar. 
We have 


tr(AB + BA) = 0 = tr (AB) + tr (BA) = 0, 
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and from tr (AB) = tr (BA), we obtain tr(AB) = 0. We also have detAB = 1 and 
from the Hamilton-Cayley relation, we obtain 


(AB) = -h (4.3) 
Now we can use the previous problem to solve the above equation. If X denotes any 
solution of the equation (4.1), then AB = X > B = A7!X, With B = A7!X, the 
equation (4.2) becomes 


A-A IX +A 1X-A =O. 


By multiplying by A to the left, we derive 


AX + XA = Op. (4.4) 
The cases X = ih and X = —ih are not acceptable, because A is invertible. 
Consequently, if 
Aa(*” xy=(4 b 
zt)’ c—a 
with x, y,z,t,a,b,c € C, @ + bc = —-1, xt- yz = 1, then the condition (4.4) 


becomes 


( >) ( = ) ( =a ) ( ] = 
zt c—a c —a zt 
In terms of linear systems, we obtain 


2ax + bz+cy=0 
b(t+x) =0 
c(t+x) =0 
—2at + bz + cy = 0. 


If t+ x 4 0, then b = c = 0. From the first and the last equations of the system, we 
deduce a = 0. This is impossible, because &@ + bc = —1. 

Thus, ¢ + x = 0. In this case, the first and the last equations of the system are 
equivalent. It follows that 


n ea) 
gx c—a 


for any x, y, z, a, b,c € C, satisfying 
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@+be=-1, xX +yz=-—l, 2ax+bz+cy=0. 


The general solution of the given equation is (¢A, uB), where ¢, u € C*. This 
includes also the case (iA, B), when det(iAB) = — det AB. O 


The following problem has a very short solution using the Hamilton-Cayley 
theorem, but it is far from obvious. 


Problem. Let A,B,C,D € M,C) such that AC is invertible and A‘B = C*D 
for all k > 1. Prove that B = D. 


Solution. Let 
f(x) = det(xl, — A) = x" +---+ (—1)” det A 
and 
g(x) = det(xl, — C) = x" + -+ + (—1)” det C 
be the respective characteristic polynomials of A and C, and let 


1 
h= aag 


Then A has the form h(x) = b,x?” +--+ + bix + 1, and by the Hamilton-Cayley 
theorem, we know that 


b ,A” +- + A+], = On 
and 
b C” +++ +bC +I, = On. 


It is enough to multiply the first relation by B and the second one by D and to take 
the difference. O 


Here is a beautiful problem proposed by Laurentiu Panaitopol for the Romanian 
National Mathematical Olympiad in 1994. 


Problem. For a given n > 2, find all 2 x 2 matrices with real entries such that 


Solution. By taking the determinant of both sides, we observe that we must have 
det X = 0 or det(X + ih) = 0. Assume for a moment that det(X + ih) = 0. Thus the 
characteristic polynomial of X, which has real coefficients, has —i as a root. Thus, 
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it also has i as a root and these are its roots. That is, the characteristic polynomial 
equals X? + 1, and by the Hamilton-Cayley theorem, we deduce that X? + h = Oo, 
which is clearly impossible. Thus det X¥ = 0 and if t = trX, we can write (again 
using Hamilton-Cayley) 


x" 4 x2 = ier + Pox. 


Therefore, if f(x) = x’ + x? — 2, we must have f(t) = 0. There are now two 
cases: either n is even, and in this case, —1 and 1 are the only zeros of f (since f 
is increasing on the set of positive numbers and decreasing on the set of negative 
numbers), and those two values t = | and £ = —1 give two solutions by the 
above relation. The second case is when n is odd, and in this case, the function f 
is increasing and has only one zero, at t = 1. This gives the corresponding solution 
X for odd n. O 


We continue with a very useful criterion for nilpotent matrices and some 
applications of it. 


Problem. Let A € M,,(C). Prove that A is nilpotent if and only if tr A* = 0 for 
allk > 1. 


Solution. If A is nilpotent, then any eigenvalue of A is zero. Indeed, if A is an 
eigenvalue of A, we know that A* is an eigenvalue of A“, and if A‘ = O,, this implies 
A* = 0 and so A = 0. On the other hand, we saw that tr A‘ is the sum of the kth 
powers of the eigenvalues of A and so it is 0. 

Now, suppose that trA* = 0 for k > 1 and let Ay, A2,...,A, be the eigenvalues 


1 
of A. Thus AX + AK +--+» + AK = 0 for all k > 1. Because 5 =1+z+ +... 


if |z| < 1, we deduce that if z is small enough, 


n 1 n Ce ae Co , n : 
> ai Hy > a= IX Se 
j=0 i=l 


i=1 i=1 j=0 


n 
Thus the rational function Sy — n is identically zero, because it has 


1 — za i 
infinitely many zeros. If all A; ae nonzero, this rational fraction has limit —n when 
|z| —> oo, which is absurd. Thus, we may assume that 4; = 0. But then AS + 
seep ak = 0 for k > 1, and the previous argument shows that one of À2,..., Àn is 
0. Continuing in this way, we deduce that all A; are 0. Hamilton-Cayley’s theorem 
shows then that A” = O, and so A is nilpotent. C 


We present now two beautiful applications of this criterion. 


Problem. Let A € M,,(C). Prove that A is nilpotent if and only if there is a 
sequence A, of matrices similar to A and having limit O,. 
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Solution. Suppose that A; are similar to A and tend to O,. Now take k > 1. Then 
Ay is similar to A*, thus 


trA* = lim trA‘ = tr ( lim A‘) = 0. 
jrow J j> J 


By the previous problem, A is nilpotent. 

Now, suppose that A is nilpotent. By performing a similarity transformation, we 
may assume that A is upper triangular, with O on the main diagonal (because A is 
nilpotent). It is easy to see that by performing conjugations with matrices of the 
form 


with c —> oo, we obtain a sequence of matrices that converges to O,. O 


Problem. Let § C M,(C) be a set of nilpotent matrices such that whenever 
A,B € S, we also have AB € S. Prove that for all A}, A2,..., Ap E€ S, we have 


(Ai + Az +--+ + Ax)” = On. 


Solution. Fix a positive integer r and let X = A; + A2 +--- + Ag. Observe that 
X” is a sum of products of the matrices A4, A2, . . . , A thus a sum of elements of S, 
which are nilpotent. By the criterion, it follows that tr X” = 0 and the same criterion 
implies that X is nilpotent. It is enough to apply Hamilton-Cayley to deduce that 
X” =0,.0 


Proposed Problems 


1. For a second order matrix A = (aj)1<;;<2, one can prove the Hamilton-Cayley 
theorem by computing directly 


A? — (ay + aa)A + (aian — A424) 


and obtaining the result O2. Find a proof of the Hamilton-Cayley theorem by 
direct computation in the case of third-order matrices. 

2. Let A be a second-order complex matrix such that A? =A,A Æ On, andA Æ h. 
For positive integers n, solve in M2(C) the equation X” = A. 

3. Let A € M2(C) be such that A” = O2, for some integer n > 2. Prove that 
A* = 0. 
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4. 


5: 


11. 
12. 
13. 
14. 


15. 
16. 


17. 


Let A,B € M>(C) be two matrices, not of the form Ah, with A € C. If AB = 
BA, prove that B = gA + Blo, for some complex numbers «, 6. 

Let A,B € M2(C) be two matrices. Assume that for some positive integers 
m,n, we have A”B” = B"A", and the matrices A” and B” are not of the form 
Ah, à € C. Prove that AB = BA. 


. Let M = {A € M(C) | det(zh — A) = 0 = |z| < 1} and let A,B € M with 


AB = BA. Prove that AB € M. 


. Letn > 2 and let Ao, Aj,...,An E M2(R) be nonzero matrices such that 


Ao Æ ah for all a € R and AoA; = AzAo for all k. Prove that 


det (> a) >0. 
k=1 


. Let A,B € M2(C) be non-invertible matrices and let n > 2 such that 


(AB)" + (BAY" = Op. 
Prove that (AB)? = O, and (BA)? = Op. 


. Let A € M3(C) be with trA = 0. Prove that tr(A*) = 3 det A. 
. Prove that for all X, Y, Z € M2(R), 


ZXYXY + ZYXYX + XYYXZ + YXXYZ 
= XYXYZ + YXYXZ + ZXYYX + ZYXXY. 


Is there a bijective function f : M2(C) > M3(C) such that f(XY) = f(X)f(Y) 
for all X, Y € M2(C)? 

Let A, B € M2(C) be such that tr(AB) = 0. Prove that (AB)? = (BA)?. 

Let A, B € M3(C) be such that (AB)? = A?B?. Prove that (AB — BA)* = O2. 
Let A, B € M,(C), with A + B non-invertible, such that AB + BA = O,. Prove 
that det(A? — B?) = 0. 

Let A,B € M,,(R) such that 3AB = 2BA + I. Prove that J, — AB is nilpotent. 
Let A, B, C e M,,(R), where n is not divisible by 3, such that 


A’ +B + C = AB + BC + CA. 
Prove that 
det [(AB — BA) + (BC — CB) + (CA — AC)] = 0. 
Let N be the n x n matrix with all its elements equal to 1/n, and let A = 


(ayj)1<ij<n € M,(R) be such that for some positive integer k, Ak = N. Prove 
that 


5 a; > 1. 


I<ij<n 
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18. Let A € M,(C), and let adj(A) denote the adjugate (the transpose of the 
cofactor matrix) of A. Show that if, for all k € N, k > 1, we have 


det((adj(A))* + In) = 1, 


then (adj(A))* = O,. 


Solutions 


1. Let A = (aj)1<ij<3 be a 3 x 3 matrix; we intend to prove that 
A? — s14? + A — 531; = 03, 
where sı = a11 + 422 + 433, 
a11 413 


a22 A23 411 412 


432 433 


S2 = 


’ 


431 433 a21 422 
and s3 = detA. The obvious problem is the calculation of A?, which seems to 
be (at least) very unpleasant. 

However, we can avoid this laborious calculation! Namely, assume first that 
A is nonsingular (hence invertible). In this case, the equality that we need to 
prove is equivalent to 


A? — 1A + 5213 — 53A7' = O3. 


Taking into account that 


di; —dz, d3ı 
s347! = det(A)A7! = adj(A) = | —dip dn —dz |, 
di3 —dy3 d33 


where dj is the cofactor of aj, for all i,j € {1,2,3} (thus s2 = dy, + d2 + 
d33), checking the equation by direct computation becomes a reasonable (and 
feasible) task. For instance, for the entry in the second row and first column, we 
need to verify that 


(214,11 + ay2421 + 423431) — (a11 + a22 + a33)a21 + diz = 0, 


which is true (remember that di2 = a21433 — a23431). In a similar manner, we 
verify that each entry of A? — 5A + 5/3 — 53A7! equals 0. 
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In order to finish the proof (which is, for the moment, done only for 
nonsingular matrices), we use a standard argument (that we have already seen 
before). Namely, suppose that A is a singular matrix, and consider, for each 
number x, the matrix A, = A — xh; Ax is nonsingular for every x in a (small 
enough) neighborhood of the origin (because det(A —x/3) = 0 for at most three 
distinct values of x). Thus the Hamilton-Cayley theorem works for every such 
A, (with x in that neighborhood). Letting x tend to 0, we get the validity of the 
theorem for matrix A as well. 

A proof of the Hamilton-Cayley theorem that generalizes the proof for 
second-order matrices to arbitrary n x n matrices can be found in the article “A 
Computational Proof of the Cayley-Hamilton Theorem” by Constantin-Nicolae 
Beli, in Gazeta Matematicd, seria A, 3-4/2014. 

2. The case n = 1 being clear, assume that n > 2. Clearly, A is not invertible, 
because otherwise, A = h. Thus (detX)” = detA = 0 and so detX = 0. 
Thus, if œ = trX, by Hamilton-Cayley, we have X? = aX and by induction, 
X” = a" |X, Clearly, a 4 0 because otherwise, A = O2. Thus X = al"A, 
that is, there exists z € C* with X = zA. The condition becomes z"A” = A. 
But A? = A implies A” = A and since A 0, we must have z” = 1. Thus the 
solutions are X; = er A withhO<k<n-1l. 

3. By considering the determinant, we obtain detA” = 0, so detA = 0. With 
a = trA, the Hamilton-Cayley relation becomes A? = œA, and inductively, 
A” = a"! . A, for all integers n > 2. Now, A” = O > a”! . A = Oo, so 
a = 0 or A = O3. In both cases, using also A? = gA, we obtain A? = Op. 


4. If we denote A = 6 A B= C z] , then from AB = BA, we derive 
c Z 


(with the convention that if a denominator is zero, then the corresponding 


. i . 0 
numerator is also zero). At least one of the fractions is not ~, because b, c 


cannot be both zero. Thus œ is well defined. It follows that 
y=ab, z=ac, t=ad+ 8, x=aa+f, 


with $ = t — ad. Finally, 


_[(xy\_ faa+B ab _ 
a= (77) =( ac wtp) TAA + Bh 


5. As we stated, we can define sequences (an)„>1 , (bn)n>1 > (Cn)n>1> (dn)n>1 Of 
complex numbers such that 


A‘ = aAt byh, BE = cB + dh, 
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for all positive integers k. From the hypothesis, am #4 0 and c, # 0. Then 
A™B" = B"A™ 
=> (amA + Dlr) (CnB + drh) = (CnB + dala) (amA + Binlr) 
= AnC,(AB — BA) = Op. 


Hence AB = BA, because amc, Æ 0. 


. First, assume that A = œh with |a| < 1. Let y1, y2 be the roots of the equation 


det(B — xh) = 0; then the roots of det(AB — xh) = 0 are ay; and ayo. Clearly 
læyı| = |a|-|yi| < 1 and |æy2| < 1. 

Secondly, assume that A # æh for all a € C. We saw in the previous 
problem 4 that there are a, b € C with B = aA + bh. 

Let x1, x2 be the roots of det(A — xJ)) = 0; then the roots of det(B — xh) = 0 
are yy = axı +b, y2 = axı +b and the roots of det(AB—xh) = O are zı = x1y1, 
Z2 = X2y2. Thus again |zi| = |x; | i ly;| <l. 


. By problem 4, there are œg, By € R with Ag = œzAo + Beh. If all a, = 0, then 


det (34 = ye > 0. 
k=1 k=1 


n 
Otherwise, let a = 2 az . Then 
k=1 


DA =gA? +2 (> ob) Ao + (> s) h = f (4o) 
k=l 


k=1 k=1 


where 
fx) = ax +2 (>: as) x+ XB. 
k=1 k=1 


By the Cauchy-Schwarz inequality, the discriminant of f is negative or zero. 
Thus we can write f(z) = a(z — zo) (z — Zo), and so 


Get (> a) = det f (A0) = @?°| det(Ao — zoh) |? > 0. 


k=1 


. With detAB = detBA = 0, and setting a = trAB = trBA, we have, by 


Hamilton-Cayley, 


(AB)? = w@AB, (BA)? = aBA. (4.1) 
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10. 


11. 


By induction, 
(AB)" = a""'AB, (BA)" = a" 'BA, 
for all positive integers n. The given equation (AB)” + (BA)” = O3 yields 
aœ”! (AB + BA) = Op. 


If a = 0, then the conclusion follows directly from (4.1). So suppose that 
AB + BA = O2. Then 


2 tr(AB) = tr(AB + BA) = 0, 


and again, œ = 0. In any case, (AB)? = O, and similarly for BA. 


. By the Hamilton-Cayley relation, we obtain the existence of œ € C with 


A>—trA-A? +a-A—detA- h = 03. 
By considering the trace, we deduce that 
tr(A*) — trA-tr(A?) +a - trA — det A - trh = 0 


and with trA = 0, trh = 3, the conclusion follows. 

Alternatively, let a, b, c be the eigenvalues of A. Then a + b + c = 0 is given 
and we have to prove that a? + b? + c? = 3abc, which is immediate. 
Observe that the identity can be written in a more appropriate form 


Z(XY — YX)? = (XY — YXY’Z. 


This is clear by the Hamilton-Cayley theorem, because tr(XY — YX) = 0, so 
(XY — YX)? = ah for some a. 
The answer is negative. Suppose that f is such a function and consider 


010 
A=]{]001 
000 


One can easily check that A? = O; and A? Æ O3. Let B € M2(C), B = f~! (A). 
Then (f(B))? = O3, thus f(B*) = O3 and so B? = Oz, because f(O2) = O3 
(indeed, let Xọ = f~! (03); then f(Xo) = O3, and so f(2Xo) = f(2h)f(Xo) = 
O3 = f (Xo), thus 2X9 = Xo and Xp = O2). But previous exercise 3 implies that 
B? = Oo, thus O3 = f(O2) = f(B’) = (f(B))* = A’, a contradiction. 
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12. 


13. 
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By the Hamilton-Cayley relation for the matrices AB and BA, we have 
(AB)? — tr(AB) - AB + det(AB) - h = O2 
and 
(BA)? — tr(BA) - BA + det(BA) - h = Op. 
By a well-known property of the trace, we have tr(AB) = tr(BA) = 0, so 
(AB)? = —det(AB)-I,, (BA) = —det(BA)- h. 


But det(AB) = det(BA), hence (AB)? = (BA). 
Using the Hamilton-Cayley relation for the matrix AB — BA, we have 


(AB — BA)” — tr(AB — BA) - (AB — BA) + det(AB — BA) - h = O2 
or 
(AB — BA)? = Ah, (4.1) 
with A = — det(AB — BA) because tr(AB — BA) = 0. From (4.1), it follows that 
tr(AB — BA)? = 22. (4.2) 
Now the conclusion of the problem follows from (4.1) if we can show that 
A = 0 or equivalently, according to (4.2), tr(AB — BA)? = 0. In this sense, note 
that 
(AB — BA)” = (AB)* — ABBA — BAAB + (BAY. (4.3) 


By the hypothesis, tr (AB)? = tr(A?B’). Using tr(UV) = tr(VU), it follows that 
tr(AB)* = tr(BA)? and 


tr(ABB - A) = tr(B- AAB) = tr(A7B’). 
Finally, from (4.3), 


tr(AB — BA)? = tr (ABY — tr(A?B?) — tr(A7B’) + tr(AB)* 
= 2tr[(AB)* — AB] = 0. 


14. Letx € C”, x Æ 0 be such that 


(A + B)x = Op. 
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15. 


16. 


It follows that Ax = —Bx, so, using yet the identity AB = —BA, we have 
A?x = —A?Bx = ABAx = —BA*x = Bx. 


Hence (A? — B?)x = O, for some nonzero vector x € C”, so det(A? — B?) = 0. 
A direct proof is provided by the equality 


£- B 


II 


(A + B)(A? — AB — B°’). 
(Prove it by using AB + BA 


follows. 
Let f(x) = det(x/,, — AB) be the characteristic polynomial of AB (and BA, too). 


Then 
2BA + I, 2r 3x— 1 
fŒ) = det | xi, — 3 =a det 5 I,—BA 


an Si an =i 
oo deel ews | a 
3n 2 z \ 9 


Thus, if z is a root of f, so are the terms of the sequence 


II 


O,!) From det(A + B) = 0, the conclusion 


3x, — 1 
X1 = Z, Xn+1 = 7 . 


Because f has a finite number of roots, the previous sequence has finitely many 
terms, which easily implies x; = 1. Thus all roots of f are equal to 1 and so 
f(x) = («— 1)". By Hamilton-Cayley, (AB — In)” = On. 

Let us consider the complex number 


ase 
3 3 
and define the matrix 
X=A+eB+&C. 
We have 
X=A+EB+PC=A+E°B+ EC. 
Then the number 


det(X - X) = det X - detX = det X - det X = |det X|? > 0 
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is real and nonnegative. On the other hand, by hypothesis and by e7+e+1 = 0, 
we get 


X-X = (A + £B + &C)(A+&B+ £C) 
= (e? + 1)(AB+ BC + CA) + «(BA + CB + AC) 
= £ |(AB — BA) + (BC — CB) + (CA—AC)]. 
Now, because det(X - X) € R, we infer that 
e” det [(AB — BA) + (BC — CB) + (CA—AC)] € R, 


so det [(AB — BA) + (BC — CB) + (CA—AC)] = 0. 
Solution I. First note that the equality AX = N implies AN = NA and this 
shows that the sums of entries in each row and column of A are all equal. Then, 
if a is the common value of these sums, an easy induction shows that A* (for 
positive integer s) has the same property as A, that is, all sums of entries in each 
row and column are equal to the same number, which is a’. In particular, since 
A= N, a‘ = 1 follows; thus, being real, a can be either 1 or —1. 

Next, one immediately sees that ye ai, represents the sum of entries from 

I<ij<n 
the principal diagonal of AA’, A‘ being the transpose of A; that is, J ai, is the 
I<ij<n 

trace of AA‘, which also equals the sum of the eigenvalues of AA’. But, clearly, 
AA’ is asymmetric real matrix, implying that all its eigenvalues are nonnegative 
real numbers. 

Finally, for the vector u with all entries 1, we have Au = u and A'u = u, or 
Au = —u and Atu = —u (remember that the sums of entries in each row and in 
each column of A are either all equal to 1 or are all equal to —1). In both cases, 
AA'u = u follows, and this shows that 1 is one of the eigenvalues of AA‘ (with 
corresponding eigenvector u). 

Let us summarize: AA’ has all eigenvalues nonnegative, and one of them is 
1. Of course, this being the case, their sum (i.e., the trace of AA’) is at least 1, 
and thus we can finish our proof: 


a; = tr(AA’) > 1. 


l<ij<n 


This is problem 341 proposed by Lucian Turea in Gazeta Matematicd, seria 
A, 3-4/2011, and here follows his solution. 
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18. 


Solution II. For any real n x n matrix X = (xj) 1<ij<n, define 


m(X) = »X xi 


l<ij<n 


thus m(X) is the sum of the squares of all entries of X. One can prove that 
m(XY) < m(X)m(Y) for all X,Y € M,(R), with the obvious consequence 
m(X*) < (m(X))* for every real matrix X and every positive integer s. In our 
case, 


1 = m(N) = m(A*) < (m(A)) 


follows, that is, m(A) > 1—which is precisely what we wanted to prove. 


For the sake of completeness, let us prove that m(XY) < m(X)m(Y) for all 
real n X n matrices X = (Xi) 1<ij<n and Y = (Vif) 1<i,j<n- Let XY = (Zi) 1<ij<n- 


Thus 
n 2 n n 
k=1 k=1 k=1 


by the Cauchy-Schwarz inequality. Summing upon j, we have 


has (Ea) (E2) Eao 
j= = = 


j=l k=1 


and, if we sum up again, this time over i, we get exactly the desired result 
m(XY) < m(X)m(Y). 

Of course, this solution shows that in place of N, we can consider any 
real matrix such that the sum of the squares of its entries is (at least) 1, and 
the statement of the problem remains true. On the other hand, from the first 
solution, we see that for any real matrix A such that AA‘ has an eigenvalue 
equal to 1, the sum of the squares of the entries of A is at least 1. We used 
the equality A‘ = N only to infer that AA’ has such an eigenvalue. Thus each 
solution provides a kind of generalization of the original problem statement. 
Let P(x) = det(xI, — A) = x" + Cn-1X"7! + +++ + co be the characteristic 
polynomial of the matrix A, and let A;,...,A, be the eigenvalues of A (i.e., the 
zeros of P). The following three facts are well-known and we will use them 
soon (prove the first and the second!). 


1) The eigenvalues of adj(A) are A2---An, ...,A1 +++ An—1 (that is, the products 
of n — 1 of the eigenvalues of A). 

2) adj(A) = (=I)! (A! + c_-A™? +--+ + cil). 

3) Hamilton-Cayley theorem: A” + Cn-14A"7! +- + cA + Coln = On. 
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Now we proceed to solve the problem. Let uy) = Ag---An, -.., Un = 
A, -+++An—1 be the eigenvalues of the adjugate of A. For any positive integer 
k, the matrix (adj(A))* + I, has the eigenvalues pe +1,...,u* + 1 (also 
well-known), and the determinant of any such matrix is the product of its 
eigenvalues. Thus the condition given in the hypothesis can be written in the 
form 


(uit De -k+1I)=1 


for all natural numbers k > 1. After expanding and using Newton’s formulas, 
one sees that this happens if and only if 4; = --- = Un = 0. This means that 


cı = (1) "(1 ++ + Mn) = 0, 

and at least one of A;,...,A, is zero, hence 
co = (-1)" det(A) = (-1)"A,---A, = 0. 

Now, the Hamilton-Cayley theorem yields 

A” + jjA ++ A? =O, 
and from the second fact mentioned in the beginning, we get 

(adj(A))* = (A! + eA" + + AY? = 
= (A" + cA"! $e + AP (AT? + cai A"? + ++ + Cot) = On, 


because the first factor is O,. The solution ends here. 

Note that the core of the problem is the fact that if the adjugate of a matrix is 
nilpotent (which we obtained when we found that all its eigenvalues are 0), then 
actually the square of the adjugate is the zero matrix. This can be also proved as 
follows. Because adj(A) is nilpotent, there exists a positive integer s such that 
(adj(A))* = O,, and the determinant of adj(A) is zero, which means that A is 
singular too (because Aadj(A) = det(A)J/,). If the rank of A is less than n — 1, 
then adj(A) is the zero matrix and there is nothing else to prove. Otherwise, 
Sylvester’s inequality shows that adj(A) has rank at most 1, and, in fact, the 
rank is 1, because adj(A) is not the zero matrix—look for problem 3 in the 
Chapter 5. Thus (see problem 4 in the same chapter), there exist B € M,,;(C) 
and C € M,,,(C) such that adj(A) = BC. In this case, we have CB = t € C. 

The equality (adj(A))* = O, can now be written as (BC) = On, or 
B(CB)*'C = O,, or t®adj(A) = On, and it implies t = 0 (as the adjugate is 
not the zero matrix, and only when s > 2—actually when s > 3—there is really 
something to prove). Thus CB = 0, and (adj(A))” = (BC)? = B(CB)C = O,. 

This is problem 360, proposed by Marius Cavachi and Cezar Lupu, in Gazeta 
Matematică, seria A, 1-2/2012 (and solved in 1-2/2013 by the second method). 


Chapter 5 
A Decomposition Theorem Related to the Rank 
of a Matrix 


Here, for sake of simplicity, we often assume that the matrices we are dealing with 
are square matrices. Indeed, an arbitrary matrix can be transformed into a square 
matrix by attaching zero rows (columns), without changing its rank. Let us consider 
for the beginning the following operations on a square matrix, which does not 
change its rank: 


1. permutation of two rows (columns). 
2. multiplication of a row (column) with a nonzero real number. 
3. addition of a row (column) multiplied by a real number to another row (column). 


We will call these operations elementary operations. We set the following 
problem: are these elementary operations of algebraic type? For example, we ask 
if the permutation of the rows (columns) i and j of an arbitrary matrix A is in 
fact the result of multiplication to the left (right) of the matrix A with a special 
matrix denoted Uj. If such a matrix Uj exists, then it should have the same effect 
on the identity /,. Hence the matrix U;,J,, is obtained from the identity matrix by 
permutating the rows i and j. But Uj, = Uj, so 
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Thus Uj is the matrix obtained from the identity matrix by interchanging its ith and 
jth rows (all the missing entries in the above expression of Uj; are equal to 0). Now 
it can be easily seen that the matrix UA, respectively AU; is the matrix A with 
the rows, respectively the columns i and j permuted. The matrix Uj is invertible, 
because U; = I,. Moreover, det U; = —1, because the permutation of two rows 
(columns) changes the sign of the determinant. In an analogous way, we now search 
for a matrix V;(œ) for which the multiplication with an arbitrary matrix A leads to 
the multiplication of the ith row (column) of A by the nonzero real œ. In particular, 
V;(a)J, will be the identity matrix having the ith row multiplied by a. But V;(a@)/, = 
V;(a), so V;(a@) must be the matrix obtained by performing one single change to the 
identity matrix; namely, its element 1 from the ith row and ith column is replaced 
by a: 


Vi(a) = OP See oh Mot wi BS 


Now it can be easily seen that the matrix V;(~)A, respectively AV;(q~) is the matrix 
A having the ith row, respectively the ith column multiplied by a. Obviously, 
det V;(~) = a 4 0, so the matrix V;(@) is invertible. Similarly, let us remark that if, 
for i Æ j, we add the jth row of the identity matrix multiplied by A to the ith row, we 
obtain the matrix (again, all missing entries are zeros): 
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WA) = 1 


Now, we can easily see that the matrix Wj(A)A, respectively AW;(A) is obtained 
from the matrix A by adding the jth row multiplied by to the ith row, respectively 
by adding the ith column multiplied by to the jth column. 

All the matrices Uj, Vi(a), Wi(A), œ € R*, A € R are invertible, and we will 
call them elementary matrices. Now we can give the following basic result: 


Theorem 1. Each matrix A E€ M,(C) can be represented in the form 
A = PQR, 


1,0 


where P, R € M,C) are invertible and Q = ( 00 


) € M,(C), with r = rank(A). 


To prove this, let us first note that every matrix A can be transformed into a 
matrix Q by applying the elementary operations 1-3. If, for example, a,, 4 0, then 
multiply the first column by Ge to obtain 1 on the position (1, 1). Then add the first 
line multiplied by —a;, to the ith row, i > 2 to obtain zeros in the other places of 
the first column. Similarly, we can obtain zeros in the other places of the first row. 
Finally, a matrix Q is obtained, and in algebraic formulation, we can write 


S,...SpAT, ... Tq =Q, 
where S;, Tj}, 1 <i < p, 1 <j < q are elementary matrices. Hence 
A = (Si -Sp OTi... T)! 


and we can take 


P= (S1...S), R= (T1... T). O 
The rank is invariant under elementary operations, so 


rank(A) = rank(Q) = r. 
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We can see that for every matrix X, the matrix QX, respectively XQ is the matrix X 
having all elements of the last n — r rows, respectively the last n — r columns equal 
to zero. Theorem | is equivalent to the following: 


Proposition. Let A,B € M,(C). Then rank(A) = rank(B) if and only if there 
exist invertible matrices X, Y € M,(C) such that A = XBY. 


If rank(A) = r, then rank(A) = rank(Q), and according to theorem 1, there exist 
X,Y invertible such that A = XQY. By multiplication by an invertible matrix X, 
the rank remains unchanged. Indeed, this follows from the proposition and from the 
relations 


XB=XBI,, BX =1,BX. O 


As a direct consequence, we give a proof of the famous Sylvester’s inequality: 


Proposition. Zf A,B € M,(C), then 
rank(AB) > rank(A) + rank(B) — n. 
Let rı = rank(A), r2 = rank(B) and consider the decompositions 
A=P\QiRi, B= P2Q.R., 
with P;, R; invertible, rank(Q;) = r;, i = 1,2. Then 
AB = P; (Qi Ri P2Q>) Ro, 
so 
rank(AB) = rank (Qı Rı P202). 


The matrix Q)RjP2Q> is obtained from the (invertible) matrix Rj P2 by replacing the 
last n — rı rows and last n — r2 columns with zeros. Consequently, 


rank(AB) > n— (n—ri)— (n= r) =r +rn-n. O 
We continue with some applications of the above results. The first one was a 
problem proposed in 2002 at the Romanian National Mathematical Olympiad: 


Problem. Let A € M,(C) be a matrix such that rank(A) = rank(A*) = r for 
some 1 < r < n — 1. Prove that rank(A*) = r for all positive integers k. 


Solution. Let us consider the decomposition A = PQR, as in theorem 1. 


00 
D e M,(C). The hypothesis implies that D is invertible. Now, observe that 


Then A? = P(QRPQ)R, and we can easily see that QRPQ = & for some 
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A? = PXOR, where X is the invertible matrix with diagonal blocks D and [,_,. 
Thus 


A? = PXORPOR = PXXOR = PX’ QR. 


An immediate induction shows that A” = PX"—'QR, and thus rank(A*) = r, 
because X, P, and R are invertible. O 


The next example is a very classical result, due to Moore-Penrose about the 
generalized inverse of a matrix. It appeared, however, as a problem in the Romanian 
National Mathematical Olympiad in 2005: 


Problem. Prove that for any matrix A € M,C), there exists a matrix B such 
that ABA = A and BAB = B. Prove that if A is not invertible, then such a matrix is 
not uniquely determined. 


Solution. Let us again write A in the form A = PQR, as in theorem 1. Let us 
search for B in the form B = R~'XP™!, This form is natural, since by writing the 
conditions ABA = A and BAB = B, the matrix RBP, denoted by X, plays a central 
role. With this substitution, the conditions that should be satisfied by B are simply 
QXQ = Q and XQX = X. This shows that we can already assume that A = Q! 
Now, let us search for X as a block matrix: ( : a 

The above conditions on X become U = I, and T = WV. Thus, we have surely 
at least one solution (just take X = Q), and moreover, if r < n, there are infinitely 
many choices for the matrices V, W, thus infinitely many such X. Therefore, if A is 
not invertible, it has infinitely many generalized inverses. L 


Problem. Let A € M,,(C) be a matrix such that 


det(A + X) = det(A) + det(X) 


for all X € M,(C). If n > 2, prove that A = O,. 


Solution. Take X = A to deduce that (2” — 2) det(A) = 0; thus, det(A) = 0. 
Therefore, if A Æ On, we can write A = PQR, where P,R € GL,(C) and 
Q= I r = rank(A) € {1,2,...,n2— 1}. Let X = PSR where S = I, — Q. 
Then det(X) = 0 because J,, — Q is diagonal and has at least one zero on the main 
diagonal. However, det(A+X) = det(P)-det(R) 4 0, a contradiction. Thus A = O,,. 
O 


Problem. Let A,B € M,(C). Prove that the map f : M, (©) > M,(C), 
f(X) = AX — XB is bijective if and only if A and B have no common eigenvalues. 


Solution. Because M,(C) is a finite-dimensional C-vector space, f is bijective 
if and only if it is injective. 
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Suppose that A, B have no common eigenvalues and let X # O, be such that 


f(X) = Op, that is, AX = XB. Let X = PQR with P, R € GL,(C) and Q = G l 


l<r<n. 
Write now P~!AP and RBR”! as block matrices 


P'ap = (^4 , RBR! = Bı Ba \ 
A3 A4 B; By 


The condition AX = XB becomes P~'APQ = QRBR™!. This in turn implies 
A3 = 0, By = 0, A; = By. Thus 


pap = (4142 , RBR! = ARD 
0 A4 B; Bs 


By the multiplicative property of determinants in block-triangular matrices and 
the fact that similar matrices have the same characteristic polynomial, it follows that 


XA = XA X44; XB 5 XA, XBa> 


where yz is the characteristic polynomial of Z. But since A} € M,(C), Xa, has at 
least one zero, which will be a common eigenvalue of A and B, a contradiction. This 
shows one implication. 

Assume now that A and B have a common eigenvalue A and observe that one can 
make A and B triangular with À on the first place of the main diagonal by suitable 
similarities, That is, there are P, R € GL,(C) such that 


A X12... Xin A 0... 0 
O x22... X2n yo1 Y2... O 
P'AP=|. | 0 |, RBR'=| ne a 
Oher Xnn Ynl Yn2 +++ Ynn 
But then we easily see that f(X) = 0, with 
1 
0 0 
X=P R#0, 
0 0 


thus f is not injective in this case. O 


We end this theoretical part with a classical result. 
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Problem. Let A € M,C). Find, as a function of A, the smallest integer k such 
that A can be written as a sum of k matrices of rank 1. 


Solution. As you can easily guess, k = rank(A). Indeed, if A has rank equal to 


r, then A = POR for some P, R € GL, (C) and Q = te a Thus 


A=A, +A, +--+ Ay, 


where 


the only 1 being on position (i, i). Clearly, rank(A;) = 1 because 


0 0 


0 0 
has rank 1 and P, R € GL,(C). Thus k < r. 


On the other hand, rank(X + Y) < rank(X) + rank(Y) thus if A = A; +--+ A, 
with rank(A;) = 1, we have 


k 


k 
r = rank(A) = rank > a) Ss > rank(A;) = k. 
i=1 


i=1 


It follows that, actually, k = rank(A). O 
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1. Consider the matrices 


ax bx az bz 


des ab LBS xy C= ay by at bt 
cd zt cx dx cz dz 
cy dy ct dt 


with complex entries. Prove that if rank A = rank B = 2, then rank C = 4. 
2. Let A = (aij) sijen» B= (bi) i<ijen PE 80 that 


ae 
ay = 2 7. bi 


for all integers 1 < i,j < n. Prove that rank A = rank B. 

3. Let A € M,C) be singular. Prove that the rank of the adjugate matrix adj(A) 
(the transpose of the cofactor matrix, sometimes called the adjoint) of A equals 
Oorl. 

4. Let A € M,(C) be with rank(A) = r, 1 < r < n — 1. Prove that there exist 
B € M,,(C), C E€ Mra(C) with 


rank(B) = rank(C) = r, 


such that A = BC. Deduce that every matrix A of rank r satisfies a polynomial 
equation of degree r + 1. 

5. Let A € M,(C), A = (ay)1<ij<en be with rankA = 1. Prove that there 
exist complex numbers x), X2,...,Xn,¥1,)2,--++»)n such that aj = x;y; for all 
integers 1 < i,j < n. 

6. Let A, B € M, (C) be two matrices such that AB = A+B. Prove that rank(A) = 
rank(B). 

7. Let A be a complex n x n matrix such that A? = A*, where A* is the conjugate 
transpose of A. Prove that the matrices A and A + A* have equal ranks. 

8. Let A E€ M,(C) be a matrix with rank(A) = 1. Prove that 


det(Z, + A) = 1 + tr(A). 
Moreover, 
det(AI, + A) = à” +A"! - tr(A), 
for all complex numbers À. 
9. Let A € M,(Z), n > 3, with detA = 1 and let B € M,(Z) have all its entries 


equal to 1. If det(7 + AB) = 1, prove that the sum of all entries of the matrix A 
is equal to 0. 
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10. 


Let A € M,(R), A = (aj), and let X be the square matrix of order n with all 
entries equal to a real number x. Prove that 


det(A + X) - det(A — X) < (det A)’. 


11. Let m < n be positive integers, and let A € Mm n(R) and B € M,,,(R) be 
matrices such that rank A = rank B = m. Show that there exists an invertible 
matrix C € M,,(R) such that ACB = Im, where Im denotes the m x m identity 
matrix. 

12. Let A and B be complex n x n matrices having the same rank and such that 
BAB = A. Prove that ABA = B (i.e., B is a pseudoinverse of A). 

13. Let A and B be complex n x n matrices of the same rank. Show that if A?B = A, 
then B?A = B. 

14. Prove that if A is an n x n complex matrix with zero trace, then there exist n x n 
complex matrices X and Y such that A = XY — YX. 

Solutions 

1. We have to prove that if A, B are invertible, then C is invertible. If 
(ad — bc) (xt — yz) # 0, 
then 
ax bx az bz ax az bx bz 
ay by at bt| _ _ jay at by bt 
cx dx cz dz cx cz dx dz 
cy dy ct dt cy ct dy dt 
a0b0| |xz00 ab00| |xz00 
_ _|Oa0b) |yt00}_ |00ab; |yt 00 
~  fe0d0} |00xz| |cd00| |00xz 
OcOd| |O0yt OO0cd| |O0yt 
ab00| |xz00 
cd00} |yt00 2 2 
er : = —(ad — b t— 0. 
Nore! osa oe eee 
OO0cd| |O0yt 
2. We will show here only that detA = det B, because in the same way, it can 


be proved that every minor of A is equal to the corresponding minor of B. We 
have, using the definition of the determinant, 
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detA = > €(0)A19(1)420(2) ** * Ano(n) 


o€Sy 


= 5y elo) (2) OO Dig ay) (277P ba) e (27 bnon) 


oESy, 


= 5 e(o) 21t2t=+n CM+eQ)+-+o)) . boba brot) 


o€Sy 


= D €(0)b19(1)b20(2) *** bnon) = det B. 


OES 
Alternatively, note that B = DAD !, where 


21 


3. Ifrank(A) < n — 2, then adj(A) = O,, because all minors of order n — 1 of the 
matrix A are equal to zero. Further, let us assume that rank(A) = n — 1. Then 


rank(Aadj(A)) > rank(A) + rank(adj(A)) — n. 
Using also Aadj(A) = O,, we derive 
0 > (n— 1) + rank(adj(A)) —1n 


so rank(adj(A)) < 1. 
4. Let A = PQR, where P, R are invertible and 


[0 
2=(40): 
Because Q? = Q, we have A = (PQ)(QR). Observe that rank(PQ) = 
rank(Q) = r and rank(QR) = rank(Q) = r because P, Q are invertible. Also, 


we can write P = (B 0) and OR = (5) for B € M,,,(C) and C € M,,,(C). 


Clearly, BC = A and rank(B) = rank(C) = r. 
_ For the second part, we use the Hamilton-Cayley theorem. For the matrix 
A = CB €e M,(C), we can find complex numbers a1, ..., a, such that 


A +aA petal =o. 
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By multiplying by B to the left and by C to the right, we obtain 
BA'C+aBA C+- +.a,BC = 0. 
Now, 
BAC = (BC)! =AH!, 1 <k<r, 
so 
AH! aA’ +---+4,A =0. 
5. According to the previous problem, in case r = 1, there exist two matrices 
Be Mna (C), CE Mia) 


so that A = BC. If 


Xı 
x 
B= 21, C= Yn), 
Xn 
then 
Xı X1Y1 X12 --- X1Yn 
Xx XQV X 72 XQ 
Ages 2 (yı y2 ayn) SA 2Y1 X2y2 Yn 
Xn XnY1 Xny2 ... XnYn 


6. The given equality can be written as 
(A—D(B-N) =I, 


so the matrices A — J and B — I are invertible. Now the conclusion follows from 
the equality A = (A — I)B, taking into account that A — J is invertible. 

7. First we prove that J, + A is invertible. Let x be any (column) vector such that 
(I, +A)x = 0 (the 0 from the right is the n x 1 zero matrix). Thus Ax = —x, and 
A?x = —Ax = x and A*Ax = —A*x = —A*x = —x. Because all eigenvalues 
of A*A are nonnegative (indeed, if A*Av = av, with nonzero vector v, then 
(Av)*(Av) = av*v, and v*v > 0, (Av)*(Av) > 0), it follows that x = 0, 
meaning that J, + A is invertible. 
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Next, A+ A* = A+ A? = (l, + AJA, with nonsingular J, + A, and the 
conclusion follows. This is problem 1793 by Götz Trenkler in Mathematics 
Magazine 2/2008 (solved in 2/2009 by Eugene A. Herman). 

8. Let us consider the matrices B, C of rank 1, 


X1 
Bal |e M,nı(©), C= (yı y2 aey) E€ Min(C) 
Xn 
such that 
xı XV X12 - - - X1Yn 
A=BC=|” (yı y2 oe Yn) = en AERE REA 
Xn XnY1 Xny2 +++ XnYn 


We also have A = B’C’, where 


x 0... 0 oo 
Be hes ne eM O, SCPE OT Ne AO. 
Bes EN 0... 0 


Then 


det(, + A) = det(I, + B’C’) = det(I, + C’B’) 


1+) xy 0... 0 
k=1 


= 0 1... 0|=1+ >) xor = 1+ tr). 
k=1 


The other equality can be obtained by changing A with A~!A. Indeed, 


det(, + A7!1A) = 1 + tr(A71A) 
1 1 
> z detA + A) = 1 + 5 tr(A) 


> det(Al, + A) =A" +A"! tr(A). 
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9. 


10. 


11. 


The matrix A is invertible, so 
rank(AB) = rank(B) = 1. 


A surprising property is that the sum of all entries of the matrix A is equal to 
tr(AB). Indeed, with the notation C = AB, we have 


n n 
cij = 5 AnD = >» ik; 

k=1 k=1 
for all 1 < i,j < n. Thus 


tr(AB) = tr(C) = y Ci = 5 (© au = 2 dik, 
i=1 1 


i=1 \k= ik=1 


which is the sum of all elements of the matrix A. 

By a previous problem, we have tr(AB) + 1 = det(/ + AB) = 1, so 
tr(AB) = 0. 
First assume that A is invertible. In this case, 


det(A + X) - det(A — X) = (det A)’ det(J, + A~!X) det(J, — A7!X), 
with 
rank(A~!X) = rank(X) = 1. 
Consequently, 


(det A)? det(J, + A~'X) det(, — A7!X) 
= (detA) [1 + tr(A7!X)][1 — tr(A7!X)] 
= (det A)’ [1 — (tr(A™'X))"] 
= (det A)? — (det A)*(tr(A7!X))? < (det A)”. 


Now we can see that the result remains true under the weaker condition 
rank(X) = 1. In case detA = 0, replace A by A + el, and let £ — 0. 

If m = n, there is nothing to prove (just choose C = A~!B™'), so we consider 
further that m < n. 

Let P be an n x n permutation matrix such that the determinant of the 
submatrix of AP with entries at the intersections of its m rows and first m 
columns is nonzero. Let M be the (n — m) x n matrix consisting of two blocks 
as follows: 
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and let A, be the n x n matrix 


using Binet’s rule for computing determinants, one sees that det A, 4 0; hence 
A, is invertible in M,,(R). 

Similarly, because B has rank m, there exists an n x n permutation matrix Q 
such that QB has a nonsingular submatrix with entries at the intersections of its 
first m rows and its m columns. Let 


N = ( On, a) 
In—m 


Bı = (QBN); 


and 


one sees that B; is an invertible n x n matrix. 
We consider C; = AP Bee thus we have 


AP APC\QB APC\N 
In =A CıB = C = ; 
ees a) i (QBN) eo eed 


whence (by reading the equality for the upper left m x m corner) Im = APC,QB 
follows. Now, for C = PC,Q (which is an n x n matrix), we get ACB = Im and 
the proof is complete. 

This is problem 335 Proposed by Vasile Pop in Gazeta Matematicd, seria A, 
1-2/2011. 
We consider invertible n x n matrices M, N, P, Q such that A = MJN and B = 
PJQ, where J is the n x n matrix having the identity matrix J, (of order r, the 
rank of A and B) in the upper left corner and all the other entries equal to zero: 


From BAB = B, we conclude PIQMJNPJQ = PJQ; hence JOMJNPJ = J. 
We write QM and NP in the form 


RS 
om= (75) 
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13. 


14. 


and 


where R and V are r x r matrices, S and X are r x (n — r) matrices, T and Y 
are (n — r) x r matrices, and U and Z are (n — r) x (n — r) matrices. Then the 
equality JQMJNPJ = J becomes 


I, O,, n-r RS L O,, n= \ 
On-r, r On-r TU On-r, r On-r 
p VX I, O;, n—r \ _ L, O,, n—-r 
YZ On=r, r On-r - On-r, r On—r 


and it yields RV = J,. Of course, this implies VR = I[,, too, and this is 
equivalent to JNPJQMJ = J, or MJINPJQMJN = MJN, that is, ABA = A. 
Solution I. For X € M,(C), we also denote by X the linear transform (from 
C” to C”) having matrix X; then ker(X) and im(X) represent the kernel (null 
space) and the range of this linear transform, respectively. Since A7B = A, 
ker(B) is contained in ker(A); having equal dimensions (because of the equality 
of the ranks), these subspaces of C” (of which one is a subspace of the other) 
must be equal. From the equality A(AB — In) = On, it follows that im(AB — In) 
is included in ker(A); therefore it is also included in ker(B); this yields B(AB — 
In) = On, which means that BAB = B. According to the result of the previous 
problem, ABA = A follows. 

We can rewrite this as A(BA — In) = O, and thus conclude that im(BA — [,,) 
is included in ker(A) = ker(B); therefore B(BA — I,,) = On, which is equivalent 
to B?A = B, and finishes the proof. 


Solution II. We have rank(A) = rank(A*B) < rank(A”) < rank(A); hence 
rank(A”) = rank(A). Because the null space of A is contained in the null space 
of A’, it follows that these spaces are equal. Also, ker(B) = ker(A), as we saw 
in the first solution; hence ker(B) = ker(A) = ker(A’). 

Now A?B = A implies A*BA = A’, or A?(BA — I,) = O,, showing that 
im(BA —I,) is contained in ker(A”) = ker(B). Consequently, B(BA —I,) = On, 
which is equivalent to B?A = B. 


This is problem 11239 proposed by Michel Bataille in The American 
Mathematical Monthly, 6/2006. The second solution (by John W. Hagood) was 
published in the same magazine, number 9/2008. 

Note first that if a,,...,a, are complex numbers that sum to 0, there exist 
distinct complex numbers x;,...,X, and a permutation o of {1,...,2} such 
that a; = x, — Xo (x) for every 1 < k < n. Indeed, this is clearly true for n = 1 
(when there is nothing to prove) and for n = 2 (when we can write aj = x1 — x2 
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and a = x2 — x1, with x, Æ x2 if a; Æ 0, while if a; = 0, then ay = 0, too, 
and we can take aj = 0 = xı — xı and a = 0 = x) — x2, with x, and x 
chosen to be different). Further, we use induction on n (a quick look over the 
case n = 2 shows what we need to do). Suppose that n > 2 and the assertion is 


proved for any m numbers, where m < n. Let a),...,a, be complex numbers 
such that a; +--+ + a, = 0. Suppose, in the first place, that {a,,...,a,} has no 
proper subset that sums to 0. We can find x;,...,x, such that a, = xp — Xk+1 
for every 1 < k < n (with x,4; = xı), and the supplementary assumption 
assures that all x, are distinct. (Just put, in an arbitrary way, a) = xı — X2, 
then a? = X2 — X3,..-, An—1 = Xn—-1 — Xn, for x3,...,%, that are precisely 
defined; a, = x, — xı follows from a; + --- +a, = 0; if x, = x), with 


l1 < k <I<n47,thna+---+a_, = 0 follows, but this is not allowed 
in this case.) So we are done by considering the cyclic permutation o defined 
by o(k) = k + 1 for all k < n and o(n) = 1. Otherwise, there exists such a 
proper subset of {a;,...,d,} with sum of elements 0. We can assume, without 
loss of generality, that a; +--+: + am = 0, where 1 < m < n. But we also have 
a; +++: +a, = 0; hence am+1 +++: + an = 0, too. Now m and n — m are both 
less than n; therefore, we can use the induction hypothesis to get a, = Yk —Ya(K) 
for all 1 < k < m (with y,,...,¥,, mutually distinct and z a permutation of 
{1,...,mj) and a, = Zk — Zk) for all m + 1 < k < n (with %n41,....2n 
mutually distinct and t a permutation of {m + 1, ...,n}). If we further choose 
some t different from all numbers y; — z; (1 < i < m,m + 1 < j < n), then we 
have a, = xk — Xo(kķ) for all 1 < k < n, and xı, .. . , X, are all distinct, if we put 
x; = y; and o (i) = z (i) forall 1 < i < m and x; = z + t and o (j) = t(j) for 
everym+1<j<n. 

Now, in our problem, we are given a matrix A = (aj) )<ij<n With tr(A) = 0, 
that is, aj; +++: + dy, = 0. By the above observation, there exist distinct 
X1, - - -, Xn and a permutation o of {1,...,} such that a; = x; — x, for every 
1 <i < n. Thus we can write the matrix A in the form A = B — C, with 


X1 0 ...0 Xo(1) T412 +++ ~@in 
B AILAI va 0 end C= 0 Xo(2) +++ An 
Anl an2 ... Xn 0 0 +++ Xo(n) 


Because B and C are triangular, their eigenvalues are the elements on their main 
diagonals; therefore both B and C have the distinct eigenvalues x1, . .. , Xn. It is 
well-known (and very easy to prove, if one uses, for instance, the canonical 
Jordan form) that B and C are similar, that is, there exists an invertible matrix P 
such that C = PBP~'. Then we have A = B — PBP™! = (BP~')P — P(BP“), 
which is exactly what we wanted to prove. 

However we won’t stop here, because we intend to effectively prove the 
similarity of B and C with the help of elementary operations (and elementary 
matrices). We first show that B can be transformed by similarity into a diagonal 
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matrix having on its main diagonal the same entries as B. To this end, the 
important observation is that W;(a@)BW,(a@)~! = W;;(w)BW;(—a@) is a matrix 
that, when compared to B, has only the entries on row i and column j changed; 
moreover, for i > j, the entry in position (i, j) is replaced by a(x; — xi) + ay. 
If we choose a = —aj(xj — x;)~! (which is possible because x; Æ x;), the new 
matrix W;(a)BW; (a)! has the same elements as B except for those on row i 
and column j, and it has 0 in position (i, j). Now we proceed as in the proof of 
theorem 1, only we use similarities at each step. Namely, we first transform all 
elements that are on the first column under x; into zeros, beginning with ap 
and finishing with a,: we first consider W21 (x)BW21 (œ)! which is a matrix 
with the same elements as B, except for the elements in the second row and first 
column. Note, however, that the entries in the second row do not change, due to 
the zeros from the first row, except for the entry in position (2, 1) that becomes 
O for an appropriate œ; yet, the element in position (1, 1) remains unchanged, 
because of the zeros from the first row. Then multiply this new matrix to the 
left with W3; (œ) and to the right with its inverse, in order to obtain a matrix 
that differs from B only by the elements on the first column situated under the 
second row and has one more zero in position (3, 1) for suitable œ, and so on. 
After we finish with the first column, we pass to the second, where we change 
all entries under x2 into zeros by using the same type of similarities (by using, 
in order, W32,..., W,2), and we continue in this manner until we transform 
Gnn—1 into 0. (Every time we apply some W; to the left and Ww," to the right 
(i > j), only the elements in positions (i, j), .. . , (n, j) change, and the entry in 
position (i, j) becomes 0 if we choose well the value of œ.) At this moment, we 
transformed (by similarities) B into a diagonal matrix B’ that has on the main 
diagonal precisely the same elements as B. 

In an analogous manner, we can show that C is similar with a diagonal matrix 
C’ having the same entries as C on its main diagonal. Because B’ and C’ have 
the same elements on the main diagonal (possibly in a different order), and 
they have all the other entries equal to 0, they are similar—use matrices of 
the form Uj in order to see that (U,B’ Uj z= U,B’ Uj only interchanges the 
entries in position (i, i) and (j, j), and, since any permutation is a product of 
transpositions, the conclusion follows). Finally, by the transitivity of similarity, 
we conclude that B and C are similar, and thus we have a complete proof (that 
needs no Jordan form or any other advanced tools) of this fact and consequently, 
we have an elementary solution to our problem. 


Remark. For the sake of (some kind of) completeness, here is the sketch 
of a little more advanced proof of the similarity of two (not necessarily 
triangular) matrices with the same distinct eigenvalues. Let x;,...,x, be the 
distinct eigenvalues and let U;,..., U, be eigenvectors of B corresponding to 
X1,...,X, respectively, and let V;,..., V, be eigenvectors of C corresponding to 
X1,...,X, respectively (column vectors, of course). Thus we have BU; = x;U; 
and CV; = x;V; for every 1 < i < n. Because x,,...,x, are distinct, the 
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eigenvectors U;,...,U, are linearly independent and Vj,...,V, are also 
linearly independent; hence, the matrices (U;...U,) and (V;...V,) are 
invertible. Let P be the matrix that changes the basis from {U,,...,U,} to 


{V1,..., Vn}, namely, such that P(U; ... Un) = (Vi ... Vn). Then P is invertible 
too, and PB = CP, that is, C = PBP™!. 


Chapter 6 
Equivalence Relations on Groups 
and Factor Groups 


Let (G,-) be a group with identity denoted by e and let @ # H C G satisfy the 
implications: 


a) x,y € H > xy cd; 
b xeHax'ed. 


Such a subset H is called a subgroup of G, which will be denoted by H < G. For 
every x, y E G, we put 


xR'y(mod H) & x ly € H 
and 
xR'y(mod H) & xy"! € H. 


We will prove that the relations R! and R” are equivalence relations on G, also 
called the left equivalence relation modulo H, and the right equivalence relation 
modulo H, respectively. We will write simply xR'y and xR"y when no confusion is 
possible. 

Indeed, xR!x because x~!x = e € H. If xRy, then 


1 


xy e€ H > (xy)! e€ H > y'x € H > yR'x, 


so R! is reflexive and symmetric. For transitivity, let x, y, z € G be such that 


xR!y a xlyeH 
yR'z ylzeH ` 


H is a subgroup, so 


TIYO!) € H 6 x lyy zE H e x! eE H S xR'z. 
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For x € G, we denote by X the equivalence class of the element x, 
X = {y € H | yR'x}. 


Analogously, we put x” = {y € H | yYR”x}. When the difference between the two 
relations is not important, we denote simply £ one of the equivalence classes X or 
X. For every x € G, we have x = xH,X" = Hx, where xH = {xh | h € H} and 


Hx = {hx | h € H}. 


Theorem. Let G be a group and let H be a subgroup of G. Then for every 
x,y € G, the sets xH, yH, Hx, Hy have the same cardinality. 


Proof. We prove that the sets xH and Hx are cardinal equivalent with H. Indeed, 
the maps a : H —> xH, B : H > Hx given by the laws a(h) = xh, p (h) = hx, h € H 
are bijective. A classical proof can be given or we can simply note that the inverses 
a! : xH —> H, and $7! : Hx —> H are given by a~!(xh) = h, and 67! (hx) = h, 
respectively. 

Denote by 


G/R! = {xH | x€ G}, G/R' = {Hx|x e€ G} 


the set of equivalence classes with respect to R! and R”. The sets G/R! and G/R" 
have the same cardinality; the map 


$: GIR! > G/R', $ (xH) = Hx! 


is bijective. First we prove that @ is well defined. More precisely, we have to prove 
the implication 


xH =H => Hx"! = Hx. 
Indeed, we have 
xH = xH > xx CHS (xe!) ') eE H S Ax! =H. 


By its definition law, ¢ is surjective. For injectivity, let x,x’ € G be such that 
o(xH) = ġ(x'H). Then 


Hx = Hx! & (xx e H ex EH e xH = xH. oO 


A subgroup N < G is called a normal subgroup of G, and we denote this by 
N < G, if one of the following equivalent assertions is true: 


(n’) for every x € G andn € N, xnx™! € N 
(n”) for every x € G, xN = Nx 
(n) for every x € G and n € N, there exists n’ € N such that xn = n'x. 


6 Equivalence Relations on Groups and Factor Groups 95 


We prove now that the assertions (n’) — (n’”) are equivalent. 
(n) = (n"). We prove that xN C Nx, because the reverse inclusion is similar. 
Let xn € xN be an arbitrary element, with n € N. Then 


xn = (xnx~')x € Nx, 


taking into account that xnx~! € N. 

(n") => (n”). Forx € Gandn € N, xn € xN. But xN = Nx, so xn € Nx. There 
exists an element n’ € N such that xn = n’x. 

(n”) => (n’). For x € Gandn €E N, letn’ € N such that xn = n’x. It follows 
xnxx! =n ENO 


Example. If G is commutative, then any subgroup of G is normal. 


Example. Let G, G’ be groups and let f : G > G’ be a morphism. The kernel 
kerf = {x € G| f(x) = e'} 


is a normal subgroup of G (e’ is the identity of G’). 
First, if x, y € kerf, then f(x) = f(y) = e and 


FD =f@FO) =f@FO)) Ss ee =e. 


Hence xy! € kerf, which means that kerf is a subgroup of G. Further, for every 


x € Gandn € kerf, we have xnx_! € kerf. Indeed, 


SEn’) = FOAMS = fe) -e far) 
= fof) = fx!) = f(e) = e. o 


If N is a normal subgroup of G, N < G, then the left equivalence class and the 
right equivalence class are identical. Consequently, the sets of equivalence classes 
G/R! and G/R’ coincide. We denote by G/N the factor set G/R! = G/R". Now 
we also denote 


X= xN = Nx. 


For all classes x, y € G/N, we define the operation 


First note that this operation is well defined. If * = and y= y, we must have 
xy = xy’. Indeed, 
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On the other hand, 


Gy) '@’y) =y lx try =y'@'¥)O/y Dy = yny EN, 


where n = (x!x’)(y'y~!) € N. Hence 4y = xy’, 
The element è = eN = Ne = N € G/N is the identity since 


The group (G/N, -) is called the factor group with respect to the normal subgroup N. 


Theorem (Lagrange). Let G be a finite group. Then for every subgroup H of G, 
we have ord H which divides ord G, where ord G is the cardinality of G. 


Proof. Let us consider the left equivalence relation R with respect to H. 

In general, any two different equivalence classes are disjoint. Because G is finite, 
there are only a finite number of equivalence classes, which define a partition of G, 
say 


G=xHUxH U.. U xH, 
with x, € G, 1 < k < n. As we proved, 


þa H| = |x2H| =--- = |x, H| = ord H, 


so by taking the cardinality, we derive 
ord G = |x H| + |xoH| + +++ + |xnH| = n- |H]. 


Hence ord G = n- ord H. O 


The number n of the equivalence classes is called the index of H in G, denoted 
by n = [G : H]. Therefore the relation 


|G| = [G : H] - |H| 


will be called Lagrange’s relation. 


The fundamental theorem of isomorphism. Let G and G' be groups and f : 
G — G be a morphism. Then the factor group G/ kerf is isomorphic to Imf. 


Proof. The proof is constructive, in the sense that we will indicate the requested 
isomorphism. We are talking about the map 
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p:G/kerf > Imf 
given by p(x) = f(x), where x € G. 
First we prove that p is well defined. To do this, let us consider x,x’ € G with 
x = x’. Thus 
=x Sxl e kerf © f(x 'x) =e! 
SSO YE) =e & FW) FO) = e. 


By multiplying to the left with f(x), we deduce that f(x) = f(x’). 
Now we use the fact that f is a morphism to prove that the map p is a morphism, 


E-D = p(xy) = fry) = fF) = eO). 


If x,y € Gare so that p(x) = p(), then 


f@) =f) e Sf =e 


S fO = e sfx = e oxy! eker e=, 


so f is injective. By its definition, f is also surjective. LI 


We end this chapter with some important and nontrivial results that can be also 
obtained using equivalence relations. 


Theorem (Cauchy). Let G be a group of order n and let p be a prime divisor 
ofn. Then there exists an element of order p in G. 


Proof. Consider S the set of p-tuples (x1, x2, ..., Xp) such that x,x)---x, = e. 
Clearly, this set has nP™! elements, because for any choice of x1, X2, .. . , Xp—1, the 
element x, is uniquely determined. Moreover, we can define a function f from S 
to S by f(x1,X2,...,Xp) = (X2, X3,-  . , Xp, X1), because if x1x2 ++ Xp = e, we also 
have x2x3+++Xpx; = e (due to the fact that xı = (x:- -Xp 7D). This function f is 
manifestly injective and thus it is a permutation of S. Also, we have f? (x) = x for all 
x € S, (where fP denotes the composition taken p times). Now, define an equivalence 
relation by xRy if and only if there exists an integer k such that x = f* (y). Because 
fP (x) = x for all x, any equivalence class has either 1 or p elements. But the sum 
of the cardinalities of the equivalence classes is the cardinality of S, which is a 
multiple of p. Because the class of (e, e, . . . , e) has one element, it follows that there 
is some (x1, X2, . - . , Xp) € S, different from (e, e, .. . , e) and whose equivalence class 
also has one element. But this implies (x1, %2,...,%)) = f(%1,%2,...,Xp), that is 
X1 = X2 = +++ = Xp = x and this element x satisfies x”? = e because (x),...,x») E€ S 
and x Æ e. Thus, x has order p. O 
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And here is an application: 


Problem. Prove that any commutative group whose order is square-free is 
cyclic. 


Solution. Let G be such a group, whose order is pip2...pn for some distinct 
prime numbers p1, p2,...,Pn. By Cauchy’s theorem, there are elements x; whose 
oruot equals p; for all i. We claim that y = x1x2 . . . Xn has order pıp2 . . . pn. Indeed, 
if y* = e, then y'?2--P» = e, and because G is abelian: it follows that x!" KaTa 
But the order of x; is pı, weeds is relatively prime to p2p3... Dn. Thus pı divides 
k. Similarly, all numbers p1, p2, ...,pn are divisors of k ond so k is a multiple of 
Pip2---Pn- This shows that G has an element whose order is at least equal to the 
order of the group, so G is indeed cyclic. L 


It is very easy to prove, using Lagrange’s theorem, that any group whose order is 
a prime number is cyclic. The following result is however much more difficult: 


Problem. Prove that any group whose order is the square of a prime number is 
abelian. 


Solution. Let G be a group of order p*, where p is a prime number. Also, 
consider the following equivalence relation on G: xRy if and only if there exists g € 
G such that y = gxg™!. For any x, it is easy to check that the set C(x) = {g|gx = xg} 
is a subgroup of G. We claim that the cardinality of the equivalence class of x equals 


Indeed, it is not difficult to check that the function f : G/C(x) — [x] (here 
-1 


Can 
[x] denotes the equivalence class of x) defined by f(gC(x)) = gxg~" is well defined 
and bijective: if gı C(x) = g2C(x), then gı = goc for some c € C(x) and so 


81X81 | = gocxc” | gy! = 82x83 


because cxc™! = x. Also, if gxg~! = hxh™!, we have h™!g € C(x) and so gC(x) = 
hC(x). Ahes f is injective. Obviously, f is surjective. This proves the claim that 


\[x]| = iat Now, G is the disjoint union of the equivalence classes, so p? equals 


the sum of ‘the cardinalities of all classes. By the above argument, any class has a 
cardinality equal to 1, p or p° and a class has cardinality 1 if and only if C(x) = G 
that is, x commutes with all elements of the group. Let Z(G) the set of such x. The 
previous remark implies that p | |Z(G)|. We need to show that Z(G) = G, so it 
is enough to show that we cannot have |Z(G)| = p. Let us assume that this is the 
case. Then G/Z(G) is a group (clearly, Z(G) is normal) with p elements, thus cyclic. 
Let p(a) be a generator of this group, where p is the natural projection from G to 
G/Z(G). Then we know that for any x,y € G, there are integers k,m such that 
p(x) = p(a*) and p(y) = p(a”). It follows that a~*x and a™™y are in Z(G) and this 
easily implies that xy = yx. Thus, any two elements of G commute and Z(G) = G 
contradicting the assumption that |Z(G)| = p. This contradiction shows that we 
must have Z(G) = G and G is therefore abelian. C 
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Proposed Problems 


10. 


11. 


. Let G be a finite group, H a subgroup of G, and K a subgroup of 


H, K < H < G. Prove that 


[G : K] = [G : H] - [H : K]. 


. Let G be a group and let H be a subgroup of index 2 in G, [G : H] = 2. Prove 


that H is a normal subgroup of G. 


. Let G be a group, H a subgroup of G, and let x, y € G. Find a bijective function 
from xH onto yH. 

. Let G be a finite group. Prove that for every x € G \ {e} with ordx = k, we 
have k divides ord G. 


. Let G be a group and N a normal subgroup of G. Assume that an element 


a € G has finite order, orda = n. Prove that the least positive integer k for 
which a‘ € N is a divisor of n. Prove that this remains true if N is any subgroup 
of G (not necessarily a normal subgroup). 


. Let G be a cyclic group, G = {x* |ke Z} , for some element x € G. Prove that 


G is isomorphic to the additive group of the integers (Z, +) or to an additive 
group Z/nZ, for some integer n > 2. 


. Let G be a group with identity e such that f : G > G, given by f(x) = x" isa 


morphism for some positive integer n. Prove that 
H={eG|xX*=e}, K={x"|xeG 


are normal subgroups of G. Moreover, prove that if G is finite, then ord K = 
[G : H]. 


. Consider the additive group (R, +) of real numbers with the (normal) subgroup 


(Z, +). Prove that the factor group R/Z is isomorphic to the multiplicative 
group T of complex numbers of module 1. 


. Consider the additive group (C,+) of complex numbers with the (normal) 


subgroup (R, +) . Prove that the factor group C/R is isomorphic to the additive 
group (R, +) of real numbers. 

Let G be a group with the property that x? = e, for all x € G. Prove that G is 
abelian. Moreover, prove that if G is finite, then ord G is a power of 2. 

Let G be a group. Denote by (Inn G, o) the group of inner automorphisms, 


InG = {t,|g€G}, 


where t, : G — Gis the automorphism 


Tex) = exe; 
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12. 


13. 


14. 


15. 
16. 


17. 


18. 
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for all x € G. Prove that the group (Inn G, o) is isomorphic to G/Z(G), where 
Z(G) < G is the center of G : 


Z(G) = {x E€ G| xy = yx, Vy E G}. 


Let G be a group and let N be a subgroup of Z(G). Prove that: 


a) N is anormal subgroup of G; 
b) if G/N is cyclic, then G is abelian. 


Let p > 3 be prime. Prove that the set of integers 0 < a < p — 1 for which the 
congruence 


xX = a(mod p) 


+1 
P elements. 


has solutions in the set of integers consists of 


Let us consider the group (Z,+) as a subgroup of the group (Q,+) of the 
rational numbers. Prove that every finite subgroup of the factor group Q/Z is 
cyclic. 

How many morphisms f : (Z2 x Z2, +) — ($3, 0) are there? 

Let (G, -) be a finite group with n elements, and let S be a nonempty subset of 
G. Prove that if we denote by S* the set of all products sı ++- Sg, with s; € S for 
all 1 < i < k, then S” is a subgroup of G. 

Let (G, -) be a finite group whose order is square-free, let x be an element of G, 
and let m and n be relatively prime positive integers. Prove that if there exist y 
and z in G such that x = y” = z”, then there also exists t in G such that x = t”. 
Let A be a nonsingular square matrix with integer entries. Suppose that for every 
positive integer k, there is a matrix X; with integer entries such that Xk =A. 
Prove that A must be the identity matrix. 


Solutions 


l. 


2. 


By Lagrange’s relation, 


meta ea Gl. Elig. 
GH) [HK] = ae = = IG K. 


We have [G : H] = 2, so there are only two left (or right) equivalence classes 


which define a partition of G. One of the sets of the partition is always H, so 
the partition is G = H U (G \ H). Then H is a normal subgroup of G, because 


G/R! = GJR" = {H,G \ B}. 
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3. We will prove that the application œw : xH — yH given by the law w(xh) = yh, 
h € H is bijective. By its definition, w is surjective. For injectivity, if @(xh) = 
w(xh’), then 


w(xh) = w(xh') > yh = yh! 


> yl (yh) = ylh) > h = H > xh = xh. 


4. Let us consider the set 
H= fe, x, x, si ae ‘ 


Any two elements of H are different. Indeed, if there exist integers 0 < i < 
j < k— 1 such that xi = x, then w! = e, with O < j— i < k. But the last 
relation contradicts the fact that ordx = k. We also have the representation 
H= fy |ie Z} . Hence for any elements x’, x/ € H, it follows 


xi)! =x -x =x EH, 
which means that H is a subgroup of G. According to Lagrange’s theorem, 
ord H | ordG & k| ord G. 


5. Let us consider the factor group G/N. We have a” = e € N, soa" = èin G/N. 
We also have a‘ € N, soa = @in G/N, where k is the minimal value with this 
property. In other words, k is the order of the element & in G/N. We also have 
@ =€@, so k divides n. 
Let’s now assume that the subgroup N is not normal. For the sake of 


contradiction, let us put n = kq + r, with 1 <r<k—1. We have 
aaa" 4= d" . (a4 = (a) EN, 


because a% € N. We obtained a” € N, with r < k, which contradicts the 
minimality of k. Hence k = 0, which means that k divides n. 

6. The application f : (Z, +) —> G given by f(k) = x*, k € Z is a surjective 
morphism, 


Fikk) Ht = kK = (SK). 


According to the fundamental theorem of isomorphism, Z/ kerf ~ G, where 
Imf = G. Iff is injective, then kerf = {0}; thus 


Z/ kerf ~ Z/ {0} > Z, 


so Z ~ G. If f is not injective, then let k < k’ be integers with 
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fi =f®) ok =X ox * =e. 
If n is the least positive integer with x” = e, then kerf = nZ. Hence 


G ~ Z/ kerf ~ Z/nZ. 


. In general, the kernel of a morphism is a normal subgroup, and here, H = 


kerf < G. For any elements x", y” € K, we have 
LON = 107D" = Gy)" EK, 


so K is a subgroup of G. It is also normal, because for every x” € K and y € G, 
we have 


yy = OTOT) e ay) = 0T" € G. 


For the second part of the problem, we use the fundamental theorem of 
isomorphism, 


G/ kerf ~ Imf 
and taking into account that kerf = H, Imf = K, we obtain G/H ~ K. Hence 


|G/H| = |K| & ord K = [G: H]. 


. Let us consider the application f : (R, +) —> (C*,-) given by f(x) = e7!. 


21x 


Clearly, f is a morphism and x € ker(f) if and only if e = |, that is, x is 
an integer. On the other hand, it is clear that Imf = T. By the fundamental 
theorem of isomorphism, we deduce that 


R/kerf ~ Imf © R/Z x T. 


. Let us consider the application f : (C, +) —> (R, +) given by f(z) = Imz, 


where Im z denotes the imaginary part of the complex number z. Clearly, f is a 
morphism and kerf = R. Obviously, Imf = R. By the fundamental theorem 
of isomorphism, we deduce that 

C/kerf ~ Imf & C/R >R. 
For every x, y € G, we have x? = e, y? = e, (xy)? = e. Thus 

(ay)? = PY & xyxy = YY. 


By multiplying with x~! to the left and with y7! to the right, we obtain 
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12. 


x" ayyy! = x" Gayy)y! © yx = ay, 
so G is abelian. 

For the second part, we will use Cauchy’s theorem: let G be a group of order 
n in which any element has order at most 2. If p > 2 is a prime divisor of n, 
by Cauchy’s theorem, G has an element of order p, which is a contradiction. 
Therefore any prime divisor of n is 2 and n is thus a power of 2. 
Let us consider the application ¢ : G > Inn G given by the law ¢$(g) = Tg. For 
every g,g’ € G, we have 


$(g) ° 4 (8) = (88. 
Indeed, for all x € G, we have 


1 


Te (Ty (x)) = Tglg' xg T) = glg'xg Tg! = (88x8 N] = Teg x), 


thus 


Tg © Ty = Tye’ > $(g) 0 P8) = p (88). 
Hence ¢ is a (surjective) morphism. If g € ker ġ, then 
$(g) = 16 > t = Ie. 


It follows that for every x € G, 


1 


Talx) = x $ gxg =X $ gx = X8, 


which is equivalent to g € Z(G). Then with ker = Z(G), 
G/kerġ ~ InnG © G/Z(G) ~ InnG. 


a) Every element of N is also an element of Z(G), so every element of N 
commutes with all elements of the group G. Then for every x € G and 
neN, 


anx | = nxx! =n € N, 


so N is a normal subgroup of G. 
b) Let us assume that G/N =< @ >, for some a € G. Now let x € G be 
arbitrary. We have x €< @ >, so there is an integer k such that 
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Hence xa~* = n & x = an, for some n € N. Finally, for every x,y € G, 
let k,p € Zand n, q € N such that x = afn, y = aq. It follows that 


xy = adn -æq = aang = a‘? ng 


and 
yx = æq- a*n = a’akqn = a ** qn, 
SO xy = yx. 
13. Let us define the application f : Z% — Z% by the formula f(x) = x’. Easily, f is 
a morphism from the group (Z5, ) to itself with kerf = {7 pi} . Indeed, if 


F € kerf, then 32 = 1, so p divides x? — 1. The number p is prime, so p divides 
x— l or p divides x + 1. Thus x = 1, respectively x = p — 1. Now, according 
to the fundamental theorem of isomorphism, 


Imf ~ Z% / kerf. 
In particular, 


card Z5 p-1 
cardImf = card(Z;/ kerf) = 


card ker f a) 
If we add the solution a = 0, we obtain 
p-1l p+1 
——— + 1 = —— 
2 2 


elements. 
14. Let H be a finite subgroup of the group Q/Z with ord H = n. We can assume 
that 


H = {G,@2,..., Gn}, 
for some rational numbers a), 42,...,d, € [0, 1). Then for every h € H with 


h € Qn [0, 1), we have 
odH -h=08n-h=08 mheZ. 


It follows that h = k, for some integer 1 < k < n — 1. We proved that 


aT on-l 
HC 40,-,..., : 
n 


n 


The other inclusion is true, because ord H = n. In conclusion, H = ( 


Sle) 
a 
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The answer is 10. Each element of the group Z2 x Znz is of order at most 2, so 
(f(x))? = e, for all x € Zz x Zp. It follows that 


Imf £ {e, (12) , (23), (31)}. 
If for some x, y € Z2 x Z2, f(x) = (12), f(y) = (23), then 


f(x +y) =f@FO) = (12) (23) = (231) ¢ Imf, 


a contradiction. Hence f(x) = e or Imf = {e, tT}, where t € S3 is transposition. 
According to the fundamental theorem of isomorphism, 


Imf > (Z2 x Z2) / kerf. 
In particular, 


2 = card Imf = card [(Z2 x Z2) / kerf] = Cees). 
card ker f 

so kerf has 2 elements. One of them is (0,0) and the other can be chosen in 
three ways. 

In conclusion, there are 3 x 3 = 9 such morphisms, and if we add the null 
morphism, we obtain ten morphisms. 
Fix some s € S. We have sS* C S**!; hence |S*| = |sS*| < |S**!| for all 
k € N*, where |X| denotes the number of elements of the (finite) set X. Thus 
we have the nondecreasing set of cardinalities 


|S] < |S°*| < |S] <--. 


Note that if, for some j, we have |S] = |S/T!|, then |S*| = |S**!| for all k > j. 
Indeed, we have s5 C S/T! and |sS/| = |S] = |S! |; thus sS = S/T! which 
implies sS/t! = sSiS = Sit!§ = Si*?, yielding |ST!| = |sS/t"| = |S/*?|, and 
the conclusion follows inductively. 

Now, if in the sequence of inequalities |S] < |S?| < --- < |S”| we have 
equality between two consecutive cardinalities, then |S*| = |S**!| for all k > j, 
with j < n— 1. Otherwise (when all inequalities are strict), we get |S”| > n; but 
|S"| < |G| = n; thus |S”| = n, and this yields |S*| = n for all k > n. Either 
way, we have 


|S" | = Js”+!] = [sets eee 


As we have seen before, this also leads to sS* = S*t! for all k > n, and, 
by iterating, we obtain s’S” = S?", that is, S° = S” = S"S" (as s” = e, 
the unit element of G, as a consequence of Lagrange’s theorem). Thus S” is 
closed under the multiplication of G, and the fact that it contains the identity is 
obvious: e = s” € S”; that is, S” is a subgroup of G. 
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This appeared as (a folklore) problem B in Nieuw Archief voor Wiskunde, 
issue 3/2007. The above solution is from the same magazine, 1/2008. 
Let o(g) denote the order of an element g € G. From x = y”, we infer that 


Ae rs 
u 


where u = (o(y), m) is the greatest common divisor of the order of y and m. 
We have that u divides the order of y, which in turn divides the order |G| of the 
group; thus, u divides |G|. Also o(x) = o(y)/j and m/u are relatively prime. 

Similarly, for v = (0(z), n), we have that v divides |G| and that o(x) and n/v 
are relatively prime. 

Also, u and v are themselves relatively prime, since they are divisors of m 
and n, respectively; therefore their product divides the order of the group. So, 
we can write |G| = vp, with integer p, and, because |G] is square-free, any 
two of u, v, and p are relatively prime. 

Now x’? = y™P = e (the identity element of G), because mvp = (m/4)|G| 
is a multiple of the order of the group, and similarly, x“? = z"“? = e. It follows 
that both numbers up and vp are divisible by o(x), the order of x, and therefore, 
their greatest common divisor p is also divisible by o(x). As p is prime to both 
u and v, the same is true for the order of x. So, in the end we get that o(x) 
is prime to any of the numbers m/u, n/v, p, and v, hence it is prime to their 
product mn. Again, we can find integers p and q such that 1 = po(x) + qmn; 
thus we have 


x= PO) amn = (E = pm 


for t = x1 € G. 


Remark. Let (M, -) be a monoid. We say that M has property (P) if, whenever 
m and n are positive integers, and x € M has the property that there are y and z 
in M such that x = y” = z”, there also exists t € M for which x = rt!" (where 
[m, n] is the least common multiple of m and n). The reader will easily prove that 
commutative finite groups have property (P). Also, by using the result of this 
problem, we can prove that finite groups with square-free order have property 
(P). We say that a ring has property (P) if its multiplicative monoid has it. Then 
factorial rings, like the ring of integers, have property (P), and with a more 
elaborate argument, one can prove that any factor ring of a principal ring has 
property (P). The proof is essentially the same as in the case of factor rings 
Z/nZ, and this is the starting point for the problem above, too. Actually to say 
that Z/nZ has property (P) represents a rewording of the fact that if an infinite 
arithmetic progression of integers contains a power with exponent m, and a 
power with exponent n, then it also has a term that is a power with exponent 
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18. 


[m,n]. This is a known folklore problem that appeared a few times in various 
mathematical contests. A particular case was the subject of problem 11182 
from The American Mathematical Monthly from November 2005 (proposed 
by Shahin Amrahov). 


Let p be a prime number that does not divide the determinant of A (which is 
nonzero). In the group G of invertible matrices with entries in the field Z/pZ, 
the equation xe = A is also valid for every positive integer k (we don’t use 
different notations for the respective reduced modulo p matrices). In particular, 
we have x = A, but every matrix from G has the |G|th power the identity 
matrix, according to Lagrange’s theorem. Thus A = Z in this group, meaning 
that all elements of A — J (this time regarded as an integer matrix) are divisible 
by p. Since this happens for infinitely many primes p, it follows that A —/ is the 
zero matrix; thus A = J, the identity matrix, as claimed. 

This is problem 11401, proposed by Marius Cavachi in The American 
Mathematical Monthly 10/2008 and solved by Microsoft Research Problem 
Group in the same Monthly, 10/2010. 


Chapter 7 
Density 


We say that a set A C R is dense in R if any open interval of real numbers contains 
elements from A. One of the practicalities of dense sets follows from the fact that 
two continuous functions from R to R are equal if they are equal on a dense subset 
of R. We have the following characterization of dense sets: 


Theorem. Let A C R. The following assertions are equivalent: 


a) A is dense in R. 
b) for every real number x, there exists a sequence of elements of A, converging 
to x. 


Proof. If A is dense, then for every real number x and every positive integer n, 
we have 


1 1 
(x- 2x4 *) NAF DG. 
n n 


Therefore, for each positive integer n, we can choose an element 


1 1 
Q,E(x-—-,x+-]NA. 
n n 


We obtained a sequence (an)n>1 of elements of A, which clearly converges to x. For 
the converse, let (a, b) C R and let x € (a, b) be fixed. Then there exists a sequence 
(@n)n>1 With elements in A, converging to x. But (a,b) is a neighborhood of the 
limit x; thus we can find an element an, € (a,b). Because an € A, it follows that 
(a,b) N A Æ Ø. In conclusion, A is dense in R. C 


The set Q of rational numbers is dense in R. We will prove that each nonempty 
interval contains at least one rational number. Indeed, let (a, b) be an interval, with 
a < b. Looking for two integers m,n, say n positive, so that a < = < b, we 
observe that it is enough to prove the existence of a positive integer n such that 
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nb — ba > 1. Indeed, if such an integer exists, the interval (na,nb) has length 
greater than 1, so it contains an integer m. Then ¥ € (a,b). But it is clear that 
n=1+ [+] is a solution. Moreover, one can easily prove that any nonempty 
interval contains infinitely many rational numbers. The inequality x — 1 < [nx] <x 

[nx] 


shows that lim => = x. In fact, this is another proof that Q is dense in R. Indeed, 
noo 


for every x € R, the sequence of rational numbers (2) converges to x. Now, 
n>1 


our assertion follows from the theorem. For instance, the sequence 


BABA BAL EA, 


is a nice example of a sequence of rational numbers which converges to 2. 
Further, the set R \ Q of irrational numbers is dense in R. Indeed, let (a, b) 


be a nonempty interval. As we have proved, the interval (+. +) contains at 


least one rational number, say q. Then gV2 € (a,b) and qV2 is irrational. So 
we proved that there are no intervals consisting only of rational numbers. Observe 
that this is also immediate by a cardinality argument: we have seen that any interval 
is uncountable, so it cannot be included in the set of rational numbers, which is 
countable. Finally, here is one more possible argument: we assert that each interval 
of the form (0, ¢), € > 0, contains at least one irrational number. Indeed, we have 
the implication: 


v2.94 (0,2), 


for all positive integers n. If q is rational, in (a,b), then let w be irrational in 
(0, b — q). Now, the number w + q is irrational and w + q € (q, b) C (a,b). 

Finally, note that the notion of density has nothing to do with the order structure 
of R—it is a topological notion. It can easily be extended to other spaces, such 
as R”. Indeed, if ||x|| is the Euclidean norm of the vector x € R”, then we say that 
a set A is dense if every open ball B(x, r) (defined as the set of points y € R” such 
that ||y — x|| < r) contains at least one element of A. All properties of continuous 
functions related to dense sets that we discussed remain clearly true in such a larger 
context. 

A very useful and general result is the following: 


Theorem (Stone-Weierstrass). Let K be a compact subset of the set of real 
numbers and let A be an algebra of continuous functions defined on K and having 
real values. Suppose that all constant functions belong to A and also that for any 
distinct points x,y € K, there exists f € A such that f(x) 4 f(y). Then A is dense in 
the set of continuous functions on K, with real values, for the uniform convergence 
norm. That is, for any continuous function f defined on K, with real values and for 
any € > 0, there exists g € A such that |f (x) — g(x)| < € for all x € K. 
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The proof of this result is far from being easy and will require several steps. 
First, observe that if we define A to be the closure of A in (C(K), || - |loo) Gi.e., the 
set of continuous functions on K with ||f||oo = max [f(x)|), then A is also an algebra 

xE 


having the same properties as A. Hence we can directly assume that A is closed. 
Now, let us show that with this assumption, if f € A, then |f| € A. By working with 


f instead of f, we may assume that ||f||oo < 1. Write 
IIflloo 
Co 
~ {1/2 p 
f| = vreD- je-m 
n=0 4 
where 


(A T 1 e 


n 


This is just a consequence of the Taylor series of (1 + x)”. On the other hand, using 
Stirling’s formula, we obtain that 
1/2 
n 


and since ||f? — 1|loo < 1, it follows that |f| can be approximated with polynomials 
in f to any degree of accuracy: simply take the partial sums of the above series. 
Since these polynomials in f belong to A = A, so does |f|. This finishes the proof of 
the first step. 
This shows that min(f, g) € A and max(f, g) € A if f, g € A. Indeed, it is enough 
ie Sei 
2 


< œ, 


to note that min(f, g) = 8 and the hypothesis made on A shows that 


min(f, g) € A. The next step will be to prove that for all x Æ y € K and for all real 
numbers a, b, there exists f € A such that f(x) = a and f(y) = b. This can be done 
by taking 


= _ V8 = 8) 
f@=a+(b CRE 


where g € A is such that g(y) # g(x). Then f € A (because A is stable 
under multiplication, addition, and contains the constant functions) and clearly 
f(x)=a, f(y) = b. 

Consider x,y € K and a continuous function f defined on K. We know (by the 


previous step) that there exists gy € A such that g, (x) = f(x) and gx») = fO). 
Let € > 0 be fixed and also fix a point x € K. By continuity of f — 8x at y, for 
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all y € K, there exists an open interval 7, centered at y and such that g(t) < 
f(t) + € for all t € I. Because the intervals 7, are an open cover of the compact 
set K, there exist y},y2,..., Yn € K such that K is covered by J,,,J,,,...,4,,. The 
function gy = min(gxy,,8xy.,---+&xy,) belongs to A (recall the first step of the 
proof), g(x) = f(x), and also, g(t) — f(t) < € for all t € K. By continuity at x, for 
all x € K, there exists an open interval Jx centered at x such that f(t) < g,(t) + € for 
all t € Jy. Using again the compactness of K, we can extract from the open cover 
(Jy)xex a finite cover Jy, ,Jy,,...,J,. Then the function g = max(gx,, 8x),--- , &x,) 
belongs to A and satisfies |g(x) — f(x)| < € for all x € K. This finishes the proof of 
the theorem. O 

Here is an application, which can hardly be proved by elementary arguments. 


Problem. Let f : [0,1] — R be a continuous function such that for all 
nonnegative integers n, 


1 
f fœx"dx = 0. 
0 


Prove that f = 0. 


1 
Solution. We deduce that / f(x)P(x)dx = 0 for all polynomials P with real 


coefficients. Now, let € > 0 sad let P be a polynomial such that |P(x) — f(x)| < € 
for all x € [0, 1]. Such a polynomial exists by the Stone-Weierstrass theorem applied 
to the algebra of polynomial functions on [0, 1]. Let M be such that |f(x)| < M for 
all x € [0, 1] (it exists by the continuity of f). Then 


1 1 
f fax = f Fl (f(x) — P@))dx < Me. 
0 0 


1 
Because this is true for all €, it follows that f f(x)’dx = 0; hence f = 0, due to 
0 


the continuity. O 


We present now a quite challenging problem, from Gazeta Matematica’s contest: 


Problem. Find all continuous functions f : R — R such that 


f(2x—y) + f(2y — x) + 2f(« + y) = 9f (x) + fy) for all x, y. 


Solution. We clearly have f(0) = 0. By taking x = y, we deduce that 
f (2x) = 8f(x). Also, with y = 0, we deduce that f(—x) = —f (x). Finally, by taking 
y = 2x we deduce that f(3x) = 27f(x). Using these results, we can immediately 
prove that f(x) = 2x°f(1) for all x € A, where A = {2"°3"|m,n € Z}. We claim 
that A is dense in [0, 00). This follows if we prove that Z + Z(log, 3) is dense in R. 
This is clearly a noncyclic additive subgroup of R. The conclusion follows from the 
following general and useful result: 
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Lemma. Any additive subgroup of the additive group of real numbers is either 
cyclic or dense. 


Indeed, let G be such a group and suppose it is not the trivial group. Let œ be 
the greatest lower bound of the set X = G N (0,00). Suppose that G is not cyclic. 
We claim that œ € G. Otherwise, there exists a decreasing sequence x, of positive 
elements of G that converges to a. Then x,—1 — Xn is a positive element of G that 
converges to 0; thus a = 0 € G, which is a contradiction. Therefore, a € G. Now, 
assume that a # 0. If x € Gand x > 0, then y = x—a@- [7] € Ganda > y > 0. 
Thus y = 0 and so any element of G is an integer multiple of œ, also a contradiction. 
This shows that œ = 0 and there exists a sequence x, € G that converges to 0, with 
Xn > 0. Therefore, if (a, b) is any interval and c € (a, b), for sufficiently large n, we 
have [=n € (a,b) N Gand so G is dense. O 


Coming back to the problem, we deduce that f(x) = xf (1) on a dense set; 
therefore f(x) = x*f(1) everywhere. Thus, any such function is of the form f(x) = 
ax? for some a. It is not difficult to check that any such function satisfies the given 
relation, so these are all solutions of the problem. O 

We said in the beginning of the chapter (and we saw in the previous problem) that 
dense sets are practical when dealing with continuous functions. The next problem, 
taken from the 2000 Putnam Competition highlights this assertion: 


Problem. Let f be a continuous real function such that 
f (2x? — 1) = 2af (x) 


for all real numbers x. Prove that f(x) = 0 for all x € [-1, 1]. 


Solution. Let F(x) = F(cos x) 
sin x 


relation implies that F(2x) = F(x). But F is clearly 27 periodic. Hence 


, defined and continuous except on Zz. The given 


F(1) = F(2*!) = F! 4 207) = F (1 +4 =) 


nr 
for all integers n, k. It is however easy to see that the set (SE |n,k € z} is dense 


|2. =| T 
z Ma 


This shows that F is constant on each interval of the form (kz, (k + 1)z). Change 
x and —x in the given relation and conclude that f is odd; thus F(x) = F(x + 7x). 
This, combined with the previous result shows that F is constant on R \ Zz and this 
constant is 0, because F is odd. Thus f(x) = 0 for x € (—1, 1) and by continuity, 
this also holds for x € {-1,1}.0 


in R, since for instance 


lim =a, forall « € R. 
noo 
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The next problem, given on the Putnam Competition in 1989, uses the density of 
the rational numbers in a very unexpected way: 


Problem. Does there exist an uncountable set of subsets of N such that any two 
distinct subsets have finite intersection? 


Solution. Surprisingly, the answer is yes and the construction is not intricate. 
Consider for each x € (0, 1) a sequence (X,)n>1 of rationals converging to x. 

Let A, be the set consisting of the numbers x,. Because the limit of a convergent 
sequence is unique, Ay N Ay is finite if x A y. Also, we have uncountably many 
sets Ay, because (0, 1) is uncountable. Now, it is enough to consider a bijection 
f : Q — N (an earlier chapter shows that f exists) and to consider the sets f(A). 
They satisfy the conditions of the problem. O 


Proposed Problems 


1. Prove that the set A = (Z |mEZ, ne N} is dense in R. 


2. Let (ad,)nen be a sequence of nonzero real numbers, converging to zero. Prove 
that the set A = {ma, |m € Z , n € N} is dense in R. 
3. Prove that the following sets are dense in R: 


a) A= |p+qv2|p.qe Zh; 
b) B= laV4+ bV2+clabceZh. 


4. Prove that the set A = io = on | m,n € Nọ is dense in R. 

. Prove that the set A = {msinn | m,n € Z} is dense in R. 

6. Let x bea real number. Prove that the set of all numbers of the form {nx}, n € N, 
is dense in [0, 1] if and only if x is irrational (here, {-} denotes the fractional part 
function). 


7. Prove that the following sets are dense in [1, oo): 


a) A={w/n|mneN, m>2}; 


1 1 1 
fit pt tte ti lmnen, me? 
n 


Nn 


b) A= 


3 


8. Let (an)nen be a sequence of real numbers with limit ++-oo, such that 
lim (an41 — an) = 0. 
noo 
Prove that the set A = {am — an | m,n € N} is dense in R. 


9. Is /2 the limit of a sequence of numbers of the form 3/m — ¥/n, with positive 
integers m,n? 
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10. Is 3 the limit of a sequence of numbers of the form 


with positive integers m > n? 
11. Letf,g:.R— R, f continuous, g monotone. Prove that if 


F(x) = 80), 


for all rational numbers x, then f = g. 
12. Letf : R — R be a continuous function such that 


fo) = (x42), 


for all reals x and for all nonnegative integers n. Prove that f is constant. 
13. Letf : R — R be a continuous function such that 


f(x) =f(x + V2) = f(x + V3), 


for all real numbers x. Prove that f is constant. 
14. Define a function f : N > [0, 2), by 


fn) = {Vn} + {vn + 1}. 


Prove that Imf is dense in [0, 2]. 
m+n i oe 
15. Let M be the set of real numbers of the form a with m,n positive 
m+n 
integers. Prove that if u,v € M, u < v, then there exists w € M so that u < 


w <v. 

16. Let (an)nen be a sequence decreasing to zero, and let (b,)nen be a sequence 
increasing to infinity, such that the sequence (b,+1 — bn)nen is bounded. Prove 
that the set A = {ambn | m,n € N} is dense in [0, co). 

17. Let (bn)nen be a sequence increasing to infinity, such that 


bn 
lim Z} =1. 


noo bn 


Prove that the set 


is dense in [0, 00). 
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18. 
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Let (an)„>ı C (1, 00) be a sequence convergent to 1, such that 
lim (ajaz +- an) = œ. 
n—>Co 


Prove that the set A = {an+14n+2*** am | n,m € N, n < m} is dense in [1, 00). 


19. Are there positive integers m > n such that 
: + f Fens f 
net. vm 1 1 
Ua LE M ef yD- +. 007+ —)? 
1 1 2007 2007 
lhea e 
V2 vn 
20. Is there a dense set of the space that does not contain four coplanar points? 
Solutions 
1. For each real a, we have 


 [2"a] 
lim = 


n>oo 2n 


Indeed, using the inequality x — 1 < [x] < x, we derive 


2"a—-1 — [2"a] 
eg 


Qn m 7 
or 

1 [2a] 

2n 2n E 


. . n 
In conclusion, for every real number a, there exists the sequence (5) 
n>1 


with elements from A, which converges to œ. Hence A is dense in R. 

Otherwise, let (a, b) be a nonempty interval of real numbers. The question 
is whether there exists an element of A lying in (a, b) . In other words, can we 
find integers m,n, n > 1 so that 


m 
a<— <b? 
Qn 
This inequality is equivalent to 


2"a<m < 2"b, 
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which we now prove. Indeed, first let us choose a positive integer n > 


log, zo) Thus 


1 
2” > — > 2"b—-2"a> 1. 
b—a 


Now the interval (2”a, 2”b) contains at least one integer, because its length is 
greater than 1. Finally, if m is that integer, then 


m m 
Za<m<2"b=a< 7 <b= 5 € (a,b). 


2. Let us consider real numbers a < b and prove that the interval (a, b) contains 
at least one element of A. Because lim — 
noo jan] 

b— 


a . a 

> 1. The length of the interval (<. — 
Jan | ‘ v lang | lang | 
contains an integer mo : 


= oo, we can find no such that 


) is greater than 1, so it 


< mgo < —. 
lang | 


[ano | 


This means that 


a < Mo|dn.| < b, 


1 
so A is dense in R. In particular, for an = zn’ we deduce that the set 


(Z lmeZ. nen} 


is dense in R. 
3. a) The sequence a, = (/2 — 1)” converges to zero; therefore the set 


{m-(V2-1)"|meZ, nen} 


is dense in R, by the previous exercise. Now, there are integers an, bn such 
that 


(V2 = 1)" = a,V2 + dy. 


Indeed, using the binomial theorem, we have 


Wa- Pen (7) wa 


k=0 
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Hence 
m(Vv2— 1)’ = man 2 + mby. 
Because 
{pv2+alp.qe Zh 5 {m(V2 1)" |meZ, nen}, 


the conclusion follows. Note that we could also use the fact that A is 
an additive subgroup of R, which is not cyclic because /2 g Q. The 
conclusion follows from a result discussed in the theoretical part of the text. 


b) The sequence a, = (4/2 — 1)” converges to zero. Therefore the set 


wm 


{m-(/2—1)"|meZ, nent 
is dense in R. Now there are integers an, bn, Cn such that 
(2 = 1)" =a V4 +b, Y2 + Cp. 


Indeed, using the binomial theorem, we have 
3 n 1k (2 3/5\n—k 
CA = Den (p) ca 
Hence 
m(V2 — 1)" = ma, V4 + mb, V2 + men. 
Because 
fai + bv2 +c a,b,c eZ} D> {n(V/2— 1)" | me Z, nen}, 


the conclusion follows. 
4. If x € A, then —x € A. Fora > 0, note that 


ji [2"a] n 2 
A h on aa) A 


Thus for every real number œ > 0, the sequence 


[2"a] n 
=i aa "I 


n 


of elements from A converges to œ. Since 0 € A, we are done. 
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5. We prove that for each £ > 0, there is a positive integer t such that 
|t — 2kr| < e, 


for some integer k. Indeed, let us consider a positive integer N with 1/N < e. 
Two of the N + 1 numbers {2s} (where {x} denotes the fractional part of the 
real number x), 1 < s < N + 1, must be in the same interval of the form 


If these numbers are {2p7} and {2q7}, with 1 < p < q < N + 1, then the 
absolute value of their difference is less than the length 1/N of each interval: 


2p} — gn} < $ < e. 


Thus we have |t — 2kr| < e for either t = [2pm] — [2q7] and k = p — q, or 
t = [2q7] — [2pr] and k = q — p (depending on which of these choices gives a 
positive f). 

Note that actually there are infinitely many pairs (t, k) with positive integer t 
and integer k such that |t—2kx | < £ (because we can repeat the above reasoning 
for N bigger and bigger) and that in these pairs ¢ cannot assume only finitely 
many values (we encourage the reader to think of the complete proof of these 
statements); hence we can find such pairs (t, k) (having all the above properties) 
with ¢ as large as we want. 

Now we can find such a pair (t1, kı) for € = 1, that is, with the property that 


|ti — 2k, r| <l. 


We then consider ¢ = 1/2, and we pick a pair (t2, k2), with h > tı, and such 
that 


1 
|t2 — 2kr | < 7 
In general, if (tj, kj) were found such that t1 < t2 < +-+ < tn and 
|t; — 2k; | < ! 
j j j 


for every 1 < j < n, we can pick a pair (tn+1,kn+1) of integers (t,4) being 
positive) such that 


1 
tnt — 2k»41a| < —— 
+1 +17 nel 
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and t,41 > tn. Thus there exists a sequence (t,),>1 of positive integers such that 
1 
[tn — 2ky| < =, 
n 
with integers k„. Consequently, 
lim sint, = 0 
noo 
and finally, the set 
{msint, |meéeZ, ne N*} 


is dense in R, by exercise 2. The conclusion follows because this is a subset of 
the set {msinn | m,n € Z}. 


. Let us suppose that x is rational, x = = where p, q are relatively prime integers. 
q 


In this case, 


p—1 


{{nx} |ne N} = 40, ere 
q 


Q le 


so this set cannot be dense in [0, 1]. 

So assume that x is an irrational number. We need to prove that the set of 
fractional parts of the numbers nx with positive integer n is dense in [0, 1]. We 
first show that if u = inf{{nx} |n € N*}, then u = 0. 

Assume, to get a contradiction, that u > 0; thus we can find some positive 
integer N such that 


eee 
i te 
Wea. ae 


We conclude that u < (u + 1)/(N + 1) < 1/N; hence there exists a positive 
integer k such that 


w+) 1 


<u <fkxt< < —. 
Noi 24 Sth <a <N 


It follows that 1 < (N + 1){kx} < 1 + 1/N; therefore the integral part of 
(N + 1){kx} is 1. Thus we have 
{(N + 1)kx} = {(N + 1)kx— (N + 1)[kx]} 
= {N + Ik} 
= (N + Ik — [(N + Di] 
= (N+ Ik} -—1 < u 
= inf{{nx} |n € N*}, 
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which is clearly a contradiction. Note that this part of the proof does not require 
the assumption that x is irrational (and, indeed, we have already seen that u = 0 
in the case of rational x). 

Now let 0 < a < b < 1 and consider a positive integer p such that p is small 
enough in order that (b — a)/{px} > 1. (Here is where we use the irrationality 
of x, which implies that {nx} is never 0; hence we can find p with not only 
{px} < b — a but also with {px} > 0.) Then, having length greater than 1, the 
interval from a/{px} to b/{px} contains at least one integer m, for which we 
have 


0<a<m{px}<b<1. 


Thus 0 < m{px} < 1; hence m is positive and {mpx} = m{px} is between a 
and b, finishing our proof. (Alternatively, one can proceed as in problem 2, by 
considering a sequence of positive fractional parts {nx} tending to 0, etc.) 

This is actually a celebrated theorem of Kronecker (1884). Note that it also 
implies the (already proved) result saying that if x is irrational, then Z + xZ is 
dense in R. In fact, we can prove that, for a positive irrational number x, the set 
{mx—n |m,n € N*} is dense in R. Indeed, let us consider arbitrary real numbers 
a and b, with a < b. Assume first that 0 < a < b, and choose a positive integer 
p > b; we then have 0 < a/p < b/p < 1, and Kronecker’s theorem implies the 
existence of some positive integer g with the property that a/p < {qx} < b/p, 
so that a < pqx — p[qx] < b, that is mx — n € (a,b) for the positive integers 
m = pq, and n = p[qx] (n is positive because x > 0 and q can be chosen as big 
as we want, for instance q > 1/x). Now if a and b are arbitrary real numbers 
with a < b, there is a positive integer k such that O < a + k < b + k; thus 
(according to what we just proved) there are positive integers m and n such that 
a+ k< mx-—n < b + k; thus mx — (n + k) is in (a, b), which we had to prove. 

7. Let 1 < a < b. Because 


lim (b” — a”) = œ, 
m—>CoO 
we can find mọ such that 
be —a™ > 1. 
Further, the interval (a’”, b”°) contains at least one integer, 
no € (a, b™) N Z; 


thus 


am < ngo < b™ Sa< "Yng <b. 
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Moreover, because a” > 1, we can denote by k the smallest integer so that 


1 mM 
I+ 5t+-+-+7>a™ 
k 
But 
bD™ >a™ +1, 
so 
amciplp ge Se 
2 k 
or 


a< te deea ei 
2 k 


8. Leta,b € R, a < b. Let no be so that m > no implies 
Am+1 — âm < b — a. 
We have 
(E (n ~ dy) = 00, 
so let mọ > no + 1 be the smallest integer with the property that 
Amo — Ang > 4. 
Therefore dmo—1 — Any < a and 
Amo — Any = (Am — Amp—1) + (Ginp—1 — Ang) < (b — a) +a =b. 


In conclusion, a < Amy — Ang < b. 
9. The answer is yes. The sequence 


an= 34n, n>1 
has limit oo and 
lim (an+1ı — an) = 0. 
n—>00 
Indeed, 
Jim, (nts — 40) = Jim, (Ya 1 - Sa) 


1 
Resin 3 2 3 3/72 = 
Vin+1?24+ Ynnt+ 1+ Vn 
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As we have proved, the set 
A = {am — a, | m,n € N} = {/m— s/n | m,n € N} 


is dense in R. 
10. The answer is yes. The sequence 


1 1 
An = DE oe ker n>1 
is increasing, unbounded and 
lim (an+1ı — an) = 0. 
noo 


For all m > n, we have 


1 1 
Am n = oe + — 
n+1 m 
As we have proved, the set 
1 1 
A = {dm — a, | m,n E N} = | —— +---+—|mneN,m>n 
n+1 m 


is dense in Ry. 
11. For each real number x, consider two sequences of rational numbers (an)n>1 
and (b,)n>1 converging to x such that 


An S an+ SXI bayi = bn, 


for all positive integers n. The function g is monotone, say increasing; thus the 
sequence (g(an))n>1 is increasing and bounded above: 


glan) < g(x). 
Let l = hm g(a,). On the other hand, 
8(4n) = f Gn) > FO), 
because of continuity of f at x; thus / = f(x). Analogously, we deduce 
lim g(an) = lim g(bn) = f(x). 
noo noo 
By taking n — oo in the relations 


8(Gn) <ga)< g(bn), 
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13. 
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we derive 
f(x) < gx) <f, 


sof = g. 
We have 


2 1 


2 
and by replacing x with x — —, 
n 
1 2 
Mae) ay 
n n 


fa) =f (x+—) 


By induction, 


for all integers m,n 4 0. For x = 0, 
m 
f (=) = £0). 


for all m € Z, n € N*. The continuous function f is equal to the constant 
function f (0) on Q, so 


fœ = f0), 


for all reals x. 
There is also another argument: we proved that a continuous, periodic, 
nonconstant function has a minimum period. It is not this case, when f has 


1 
periods of the form —, for every positive integer n. 
n 


First, with x > x + /2 and so on, we obtain 

f(s) =f@et V2) =f(«+2Vv2) =..., 
and with x > x + /3 and so on, we obtain 

fx) =f + V3) = f+ 2V3) =... 
so 


f(x) = f(x + mV2) = f(x + nV3), 
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for all integers m,n. Moreover, 
fat mV2 + nvV3) = f(x + mV2) = fA) 
and for x = 0, 
f (mv2 + nv3) = f(0), 
for all m,n € Z. It follows that f is constant because the set 
{mV/2 + nvV3 | m,n € Z} 


is dense in R (using the same argument as in exercise 3a). 
14. Let 0 < a < b < 2. For each positive integer k, with 


1 b? — a? 
2= f 
b-a 4 
the length of the interval 


(6 oe (x+ Js ) 


is greater than 1, so at least one integer n lies in this interval. For such an n, we 


k> 


have 
pee): eee) 
+5) < sikta — 
( 2 n<( 5) 
a b 
kts <Vn<vnt1<k+;5. 
Hence 
5 < iva} <5 
and 
c<] +1} h 
ee n <. 
2 2 


In conclusion, 
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15. The function 


f:[low)NQ-R, 
given by the formula 


x+1 
S rar 


is decreasing. The set M is the image of the function f. Let u, v € M, u < v be 
of the form 
mı +n m +m 
u= —_, = —, 
yim +m yma + 13 


with mı > nı and m > m. We can write 


my, m 
— +1 — +1 
n n 
u = : , v= 
2 2 
m 
— +1 +1 
ni ny 
or 


Hence from 


mı mM 
m o mtm m 
mMm 2 ~ ny 
we deduce that 
mı m2 
pa a 


lies between u and v, because of the monotony of f. 


16. Assume that b +1 — bn < M. Let 0 < a < b. Choose an integer m such that 


b-a 
M 


Gm < 
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We can find an integer n such that 


a b 
— < b, < —, 
am Am 
because the situation 
a b 
bg < — < — < bk+1 
am am 


is not possible. Indeed, this would imply 


-a 
bey — bk > 


>M, 
am 


which is a contradiction. In conclusion, a < ambn < b. 
17. With the notation a, = Inb,, we have lima, = oo and 


bn 
lim (@n41 — an) = lim In Z= = 0. 
noo n—>Co n 
As we have proved, the set 
{dm — an| m,n € N} 
is dense in R, so the set 
bin | N 
— |mne 
bn 
is dense in [0, 00). 
18. Let us define the sequence (b,),., by the formula 
by = A, a2++* Ay. 
We have 
bn -anan 
+1 _ 4102 AnAn+1 ay ee a 


bn a1a2**: An 
so the sequence (b,,),,5 is increasing. Then 


Ont : 
lim == = lim an+1 = l; 
n>œ bp n—>oo 
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hence we have the hypothesis of the previous problem. Thus the set 
bm 
A={—|mnEeN,n>l1l 


is dense in [0, oo) and the conclusion follows if we take into account that 


b didz m 


m 
a N, = Gn414n4+2°** Am. 
bn aatan 


19. The answer is yes. We will use problem 17 with the sequence 


1 1 
bi =1+ — +- +—, n>l. 


v2 Vn 


As we know, the sequence (b,),,, is unbounded and is clearly increasing. 
Moreover, 


Dn+1 
so 


converges to 1. Hence the set 
n 


n 


Din 
A=} ImneN, mn> I] 


is dense in [0, co). But for m > n, we have 


1 1 1 1 1 
1+ — ++ + + +e + — 
ml V2 A (5 Jn +2 aa) 
b, 1 1 
n 1+ +e + — 
v2 vn 
: + : ++ : 
ju Natl n+2 m 


Finally, if m > n are so that 


bin 1 1 
m e (1+ 72007- ——, 1+ v2007+— ), 
b, ( t 2007 T T ay) 
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then by subtracting unity, we obtain 


1 1 1 
+ pee 
Tel OED 1 1 
AEL ONIE? vm e (J-V + _). 
Peet hae ROE 2007 2007 


V2 vn 


20. Surprisingly, the answer is yes. Indeed, take a, to be the sequence of points in 
space with all coordinates rational numbers and consider the balls B, centered 
at a, and having radius L, It is clear that any sequence (xn)n>1 with xn € Bn 
for any n > 1 defines a dense subset of the space. But since a ball cannot be 
covered by a finite number of planes, by induction, we can construct a sequence 
Xn € B, such that x, is not on any plane determined by three of the points 


X1,X2,...,X,-1. This shows the existence of such a set. 


Chapter 8 
The Nested Intervals Theorem 


It is well known that every monotone sequence of real numbers has a limit, finite or 
infinite. If (%,)n>1 is Increasing, then 


lim x, = Sup xn 
noo n>1 


(where it is possible for the supremum to be oo) and if (%,)n>1 is decreasing, then 


lim x, = inf x, 
noo n>1 

(where the infimum can be —oo). The monotone convergence theorem gives further 
information, namely, that every bounded monotone sequence of real numbers is 
convergent, i.e., it surely has a finite limit. The proof is based on the least upper 
bound axiom, asserting that any nonempty bounded above set of real numbers has 
a least upper bound. (Actually this statement is equivalent to the nested intervals 
theorem that follows, and each of them expresses the completeness of the system of 
real numbers.) 

As a direct consequence of the monotone convergence theorem, we give the 
following result. We note that if Z = [a,b] is a closed interval, then I(T) = b — a 
denotes the length of 7. 


The Nested Intervals Theorem. Let 
QDh2:::2Dh2::: 


be a decreasing sequence of closed intervals. Then 


(\in #9. 


n>1 
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Moreover, N I, is a singleton in case lim I(I,) = 0. 
n—>00 


n>1 


Proof. IfI, = [an, bn], n > 1, then from the inclusions 
[a1, bı] 2 [a2, b2] 2 -++ 2 [an, bn] 2 ++ 
we deduce the inequalities 


a Sa LeS Aan Si S ba So < b < b. 


Hence (a,),>1 is increasing and (b,),>1 is decreasing. Let us denote 


a= lima, b= lim dy. 
noo noo 


From a, < bn, it follows a < b. Further, we will prove that (i = [a,b]. If 
n>1 
zE [Nh then z € In > a, < z < bn for all integers n > 1. By taking n —> oo in 
n>1 

the last inequality, we derive a < z < b, thus z € [a, b]. 

Conversely, if z € [a,b], then a, < a < z < b < ba soa, < z < bn, for all 
integers n > 1 and then z € N Ih. 

n>1 
For the second part, we have 


lim (bn —a,) = 0, 
noo 
NYO) 


lim a, = lim b, = a. 
noo noo 


Finally, ( ) J, = {a}. Indeed, if x € (`) Z, then a, < x < b, for all n > 1, and for 
n=1 n>1 
n — oo, we deduce a < x < a, which is x = a. O 


This theorem is an important tool to establish some basic results in mathematical 
analysis, as we can see below. 


Problem. Let f : [a,b] — R be a continuous function. If f(a)f(b) < 0, prove 
that there exists c € [a, b] such that f(c) = 0. 


Solution. Let us consider the decomposition 


la, b] = E ad U | 
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b b 
and denote by J; = [a;, bı] one of the intervals E us | or É > ; J such that 


2 
f(a)f (bı) < 9. 
If the 


i= lax, bx], 1<k<n-l 


are already defined, then let Z, = [a,, bn] be one of the intervals 


an—1 + bn-1 an—1 + Dn-1 
an-1) a or a Dn-1 


such that f(an)f (bn) < 0. Inductively, we defined the decreasing sequence (J,)n>1 
—a 


2 n 


f(an)f (bn) < 9, (8.1) 


of closed intervals with /(/,) = and 


for alln > 1. Let {c} = [Nh As we proved, a, — c and b, — c. If we take 
n>1 
n — oo in (8.1), we get that (f(c))* < 0, because of the continuity of f. 
In conclusion, f(c) = 0. O 


In fact, this is basically a proof of the following: 


Theorem (Cauchy-Bolzano). Let I C R be an interval. Then any continuous 
function f : I —> R has the intermediate value property. 


Proof. We have to show that for any a,b € J and for each À between f(a) and 
f(b), we can find c between a and b such that f(c) = A. 

We assume that a < b, without loss of generality. The function g(x) = f(x) — À 
is continuous on [a, b], and, from the bounds on A, g(a)g(b) < 0. Consequently, 
g(c) = 0 for some c € [a, b]. 0 


Cauchy-Bolzano’s theorem has many other interesting proofs. We present here a 
proof of this theorem using the compactness of [a, b]. 

Assume, by way of contradiction, that the continuous function g : [a,b] > R 
takes the values g(a) and g(b) of opposite signs and g(x) Æ 0, for all x in [a, b]. For 
each x € [a, b], we have g(x) Æ 0, so there is an open interval Z, > x such that g has 
the same sign as g(x) on J, N [a, b]. 

The family (J,)xe{a,4) is an open cover for the compact set [a, b]. Consequently, 
there are 


X1, X2... Xn € [a,b] 


such that 


iy Nla 0G, D [a,b]. 
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Assume that 
X1 < X< << Xp, 
so the intervals 


y E ETE E 
are denoted from left to right. In particular, a € Iy, and b € I,,. 

The function g keeps the same sign on J,, N [a, b] and [,, N [a, b]. 

Because I, N Ian # ø, g will keep the same sign on (/;, UJ,,) N [a,b]. 
Inductively, g keeps the same sign on 


(La UL, U+ UE) A fa, b], 
for each 1 < k < n. In particular, g keeps the same sign on 
(Li U Io U -+ U h) O [a, b] = [a, b], 
which contradicts the fact that g(a) and g(b) have opposite signs. 
Moreover, we know that continuous functions transform connected sets into 
connected sets. The interval [a,b] is connected, so g([a,b]) is connected. This 


remark can be another proof of the Cauchy-Bolzano theorem. L 
The next application is the following fixed point result: 


Theorem (Knaster). Any increasing function f : [a, b] —> [a,b] has at least one 
fixed point. 


Proof. We have f(a) > a and f(b) < b. If 


(F) 


2 2 


a+b 
then put a; = a,b, = S In case 


a+b a+b 
> : 
i ) 2 


b 
we put a; = 2. bı = b. Either way, for the interval J; = [a,, b1], we have 


f(la)>a, f(b) <b. 
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By induction, we can define the decreasing sequence J, = [an, bn], n > 1 of closed 
intervals with 


b 


a 


In) = 


7 
and 
flan) Zan, f (bn) < bn, (8.2) 
foralln > 1. Let c be the common limit of (an)n>1 and (bn)n>1. If we take n —> œo in 
an < f (an) < f (bn) < bn, 
we derive 
lim f(a,) = lim f(b,) = c. 
nod nod 
Further, by the monotonicity of f and from the inequalities 
an SC < bn, 
we deduce 


flan) < f(c) < f (bn). 


Finally, for n + oo, we have c < f(c) < c, that is, f(c) = c. O 


Let us try to extend the lemma of the nested intervals to higher dimensions. Let 
us recall first some basic facts about the topology of R”. By definition, a set A C R” 
is open if for all x € A, there exists r > 0 such that B(x, r) C A. Here, B(x, r) is the 
open ball centered at x and having radius r, that is, the set 


tye R"| |x= yll < rj, 


where ||x|| = /x} +---+ x2 is the Euclidean norm. A is called closed if R” \ A 
is open. The interior of a set A is the largest open set contained in A. If Int(A) 
denotes the interior of A, then x € Int(A) if and only if there exists r > 0 such that 
B(x,r) C A. Finally, if A is bounded, the diameter of A is defined by 


diam(A) = sup ||x— y]l. 
x,yEA 


A famous theorem asserts that for a subset A of R”, the following statements are 
equivalent: 
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1) A is compact, that is if A C U O;, where O; are open sets and / is an arbitrary 
iel 
set, then there exists a finite subset J of J such that A C U Oi. 
ie] 
2) A is closed and bounded; 
3) Any sequence whose terms are in A has a subsequence converging to an element 
of A. 


We will not prove this theorem here, but will use it to generalize the lemma of the 
nested intervals: 


Theorem. Let (K„)n>ı1 be a decreasing sequence of nonempty compact sets. 
Then (\Kn is nonempty and moreover, if lim diam(K,) = 0, then (\Kn is a 
n—>Co 


n>1 n=1 
singleton. 


Let us briefly present the proof. Choose a sequence x, € K, and observe that 
all its terms lie in the compact set Kı. Thus it has a subsequence (Xp,)i>1 Which 
converges to a certain x € Kı. Observe that x, € Kp for all k > p (because m > 
k > p, thus x, € Kn, C Kp), thus x € Kp, too (because K, is closed). Because p 
was arbitrary, x € N Kp, and the first part is proved. The second part is obvious: if 

p21 


a Æ b € (` Kn, then 


n>1 


diam(K,) = ||a—bl|, Vn >= 1 


thus passing to the limit, a = b, a contradiction. O 
We end this theoretical part with a very useful result called Baire’s theorem. It has 
a very easy statement, but the consequences are striking. 


Theorem (Baire). 


1) Let (Fn)n>1 be a sequence of closed sets in IR‘, each of them having empty 

interior. Then U F,„ also has empty interior. 

n>1 
2) Let (On)n>1 be a sequence of dense open sets in RË. Then N On is also dense. 
n>1 

Observe that it is enough to prove just the second assertion because the first one 
is obtained by considering O, = R@\ F,,. So, let us fix an open ball B(x, r) and let us 
prove that N O, intersects this ball. Because O is dense in R, it intersects B(x,r) 


n>1 
and because it is open, there is a closed ball B(x, r1) C B(x, r) N O1. Because O3 is 


dense in R? and open, there is r2 < 5 and x such that the closed ball B(x2, r2) C 


r. keea 
B(x1, r1) N O2. Inductively, we construct x, and r,4) < > such that B(Xn+1, +1) C 
B(Xn, Tn) IY On+1- 
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ri 
Now, |lxn+1 — xnl] Sms Qn-1? 


thus X Onyi — Xn) is absolutely convergent, 
n>1 


thus convergent. This implies that x, converges to a certain p € R°. It is immediate 


to see that p € N On | N B(x, r), which finishes the proof of the theorem. O 
n>1 
Actually, using exactly the same idea (except for the argument showing that x, 
converges) one can show that we can replace R? by any complete metric space. Here 
is a beautiful application, for which you can find another solution by studying the 
proposed problems. 


Problem. Prove that there is no function f : R —> R which is continuous exactly 
at the rational numbers. 


Solution. We will prove first that the set of points where a function is continuous 
is a countable intersection of open sets. Indeed, let us define 


o(f.x)=inf sup —[f(a) — f ()| 


r>0 a, be(x—r,x+r) 


1 
and O, = x E R | (f, x) < “|. 
n 
We claim that f is continuous precisely on N On. Indeed, it is pretty clear that f 
n>1 
is continuous at x if and only if w(f,x) = 0 (just use the definition of continuity and 


the triangle inequality). 
Now, we prove that each O, is open. Let x € O, and let r > 0 be such that 


1 
sup (fla) —f)| <= 


a,be(x—r,x+r) 


Clearly, there is ô > 0 such that if |x — y| < 6, then 


( i +5) c¢ +r) 
y z? 7 x—r,x+r), 


thus 


s rosos w Ia) FO) <5 


a,bE€ (v-$.y+$) a,be(x—r,x+r 


1 
and so w(f, y) < —. This shows that (x — 6,x + 6) C On and so O, is open. 
n 
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Finally, suppose that Q = N On, where O, are open sets. Then 


n>1 


R\Q=(JR\0,) = R= Vis uUR\ 0). 


n>1 xEQ n>1 


Because {x} and R \ O, are closed sets, using Baire’s theorem, we deduce that 
either some {x} or some R \ O, has nonempty interior. The first case is clearly 
impossible. The second case would imply that R \ Q has nonempty interior, that is, 
Q is not dense in R, which is again impossible. O 


Proposed Problems 


1. Let (%:)n>1 be a sequence of real numbers such that x„+2 lies between x, and 
Xn+1, for each integer n > 1. If x,41 — Xn — 0, as n > ov, prove that (%,)n>1 
is convergent. 

2. Prove that the interval [0, 1] is not a countable set, using the lemma of the nested 
intervals. 

3. Consider a continuous function g : [a,b] —> R, with g(a) < g(b) andf : 
[a,b] — [g(a), g(b)] increasing. Prove that there exists c € [a,b] such that 
fœ) = ge). 

4. Is there a function f : [0, 1] > R with the property that 


lim [f(2)| = 00, 


for all rational numbers a from [0, 1]? 
5. Let f : [0, 1] —> [0, co) be an integrable function such that f > 0 and 


Í e dx = 0. 


Prove that there is c in [0, 1] with f(c) = 0. Deduce that the set of zeroes of the 
function f is dense in [0, 1]. 

6. Let f, g : [a,b] — R be two functions whose continuity sets are dense in [0, 1]. 
Prove that there exists z in [a,b] such that f and g are both continuous at z. 
(By the continuity set of a function f, denoted Cy we mean the set of all points 
x such that f is continuous at x). 

7. It is well known that the Riemann (or Thomae) function is continuous at every 
irrational point and it is discontinuous at every rational point. 


(a) Does there exist a function s : [0,1] — R which is continuous at every 
rational point from [0, 1] and discontinuous at every irrational point from 
[0, 1] ? 

(b) Is there any continuous function f : R — R that maps the rational numbers 
to irrationals and vice versa? 
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8. 


10. 


(a) Find a function f : R —> R+ with the property: 
any x € Qis a point of strict local minimum of f. (*) 


(b) Find a function f : Q —> R+ with the property that every point is a point of 
strict local minimum and f is unbounded on any set QN/, with Z an interval 
(that is not reduced to one single point). 

(c) Let f : R —> R+ be a function unbounded on any set Q N Z (with J an 
interval). Prove that f does not have the property (*). 


. Let f : R — R be a continuous function with the property that lim f(x) = 0 
n=>00 


for all x € R. Prove that lim f(x) = 0. 
xX 0O 
Let f : [0, 1] — [0, 1] be a continuous function with the property that 


0 € tx SO). SF OO) SF FO), -. 5 


for all x € [0, 1]. Prove that f” is identically 0 for some n. 


11. Prove Lagrange’s mean value theorem by using the nested intervals theorem. 
Solutions 
1. Let J, be the closed interval with extremities at x, and x„+1. The length of the 
interval J, is equal to |x,+1 — Xn| , SO 
lim 1h) = lim |%41 — Xn| = 0. 
noo n—> 00 
From the fact that x„+2 lies between x, and x,+1, it follows that [,41; C I,, so 
the sequence (J;,),>, of closed intervals is decreasing. We can therefore find a 
real æ such that 
Qa 
n>1 
Moreover, the sequence (x,),>) is convergent to œ, as the sequence of the 
extremities of intervals J,, n > 1. 
2. Let us assume, by way of contradiction, that 


[0,1] = {x, |n EN, n> 1}. 


Let J; be one of the intervals 
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for which x, ¢ I. Further, we divide the interval /; into three equal closed 
intervals and denote by J) one of them so that x. ¢ J). In a similar way, we can 
inductively define a decreasing sequence of closed intervals 


KDhDkRD...DI,2D... 


such that x, ¢ In, for all integers n > 1. The length of the intervals tends to zero 
as n — œo, 


1 
(n) = =. 
(i) = 5, 


so all intervals have a unique common point, say 


[h= {eh 


n>1 


Because c € [0, 1], we can find an integer k > 1 so that c = xg. But x, € Ik, so 


Xk ¢ [Nh 


n>1 


which is a contradiction. In conclusion, [0, 1] is not countable. 
3. We have f(a) > g(a) and f(b) < g(b). If 


(=) x (=) 
i 2 ZE 7 ; 


then put 
a+b 
aq=a, b= z` 
In case 
a+b a+b 
>g , 
2 2 
we put 
a+b 
aq; = 7 š bi =b. 


Either way, for the interval J; = [a;, bı] we have 


Pah = gla), FOr) = ath). 
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By induction, we can define the decreasing sequence J, = [an, bn], n > 1 of 
closed intervals with 


b-a 
(L) = 
(m= 
and 
flan) = glan), (bn) < g(bn), (8.2) 
for alln > 1. Let c be the common limit of (an)n>1 and (bn)n>1. If we take 
n—>ooin 
glan) <f(Gn) Sf (bn) < g(bn), 
we derive 


lim f (an) = lim f (bn) =g (c). 


Further, by the monotonicity of f and from the inequalities a, < c < bn, we 
deduce 


(an) < f(c) < f (bn). 


Finally, for n — oo, we have g (c) < f(c) < g(c), which is f(c) = g(c). 
4. The answer is no. Because 


li = 
ale lf (x)| = æ, 


we can find a closed neighborhood 7; of 1/2 for which |f(x)| > 1, for all 
xel,\ {5} . Now, if we take a rational number r; in the interior of J; \ {5} ; 
then from 


lim |f(@)| = œ, 
XK], 
we deduce the existence of a closed neighborhood Jy of r1, say h C J; and 
1 
Ih) < 5!) 


such that |f(x)| > 2, for all x € h \ {rı}. Inductively, we can construct a 
decreasing sequence of closed intervals (J,,),>1 with 


Ibn) < ==) 
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such that for each n > 1, |f(x)| > n, for all x € I, \ {r,}. If c is the common 
point of all intervals J, n > 1, then we must have 


F| >n, 


for all n > 1, which is impossible. 


. For each ¢ > 0, there is a closed interval denoted /(¢) for which f(x) < e, for 


all x € I(e). Assuming the contrary, then there exists £o > 0 with the following 
property: for every interval Z C [0, 1], we can find x € J such that f(x) > £o. 
Then for every partition (xk)o<g<n Of [0, 1], we choose & € [x—-1, xx] so that 
f(Ek) = £0. Now we can find a sequence of partitions with norm converging to 
zero for which the Riemann sums 


oa (f, &) = DVOSED — xXk—1) = X eo — Xk—1) = £0. 
k=l k=l 


Therefore 


1 
[ Dazn, 
0 


which is a contradiction. Thus, the assertion we made is true. Now let I © 
[0, 1], Z1 = [a1, bı] be such that f(x) < 1, for all x € Jı. Hence 


by 1 
0< fŒ) as f f(x)dx = 0, 
a 0 


so 


bı 
f(x)dx = 0. 


a 


We can apply the assertion for the restriction f|;,. Indeed, there exists a closed 
interval D C J, so that f(x) < L, for all x € h. By induction, we can find 
a decreasing sequence (J,,),,>, Of closed intervals for which f(x) < 1, for all 


x € l. Finally, if c is a common point of all intervals J,, n > 1, then 


E <4, 
n 


forall n > 1, so f(c) = 0. 
Next, let [a, b] € [0, 1] and define the restriction 


flac) : la, b] > R 
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of the function f. Because f |ja,b] = 0, it follows that 


b 1 
o< f fa) des | f(x)dx = 0, 
a 0 


so 


J E dx = 0. 


According to the first part of the problem, there exists c € [a,b] so that 
f(c) = 0. In conclusion, f has a zero in every interval [a, b] € [0, 1]. 

6. We have to prove that Cy N Cg # Ø. Let xo € Cf N (a, b). From the continuity of 
f at xo, we can find a closed interval xọ € Jo C (0, 1), with 1p) > 0 for which 


x,y E€ b > If) —fO)| < 1. 


Indeed, Jọ can be taken with the property that 


x € fy > A-S] < 5. 


In this way, for any x, y € Jp, we have 


1 1 
IF) — FO)! S FE —FGo)| + FO) — Fol < 5 L 


Further, let x} € C N Io. Similarly, we can find a closed interval x; € 1) C 
Io, 1) > 0 for which 


xy Eh > |œ- g()| < 1. 


We also have 


Fa -fOI <1, Is@)—-sO) <1, 


for all x,y € Iı. By induction, we can define a decreasing sequence (J,)n>1 of 
closed intervals, with /(/,) —> 0 as n — oo, such that 


1 1 
lf) -fol < JT? Ig) — g0)| < an? 


for all x, y € I. If {c} = N In, then obviously c € Cf N Cy. 


n>1 
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We denote by r both the Riemann (or Thomae) function, given by r(x) = 0 if x 
is irrational, and r(x) = 1/q if x is rational, written x = p/q in its lowest terms 
(that is, p and q are relatively prime integers, and q > 0), and its restriction 
to the interval [0, 1]. The restriction is used in the first part and the function 
defined on the entire set of the reals—in the second. 

For (a) the answer is no. If such a function s exists, then the continuity set of 
s is dense in [0, 1], Cs = Q N [0, 1] . Consider here also the Riemann function 
r with the continuity set 


C, = (R\@ A [0,1]. 


Now we have the functions r,s : [0,1] — R with the continuity sets dense 
in [0, 1], 


C, = ©, = [0,1]. 


According to the previous problem, there exists a point z € C, N Cs. This is 
impossible, because the following relations are contradictory: 


zEC, & z€ (R\QN(, 1] 
and 
zZzEC SO zEQN(O, 1]. 


For (b) the answer is still no. Suppose, on the contrary, that f : R > R is 
continuous and has the property that f(x) is an irrational (respectively rational) 
number whenever x is rational (respectively irrational). Let r be Riemann’s 
function defined on the entire set of reals. Thus r is continuous (and takes 
value 0) exactly at the irrational points. Let g = r o f and note that g is also 
continuous at every rational point q (as q is transformed by f into an irrational 
point, at which r is continuous). On the other hand, let i be any irrational 
number, and assume that g is continuous at i. If (qn)n>1 is a Sequence of rational 
numbers with limit i, we must have 


a) = lim glg) & rfO) = lim fqn) = 0 


(because each f (qn) is irrational); hence f(i) has to be irrational—which is not 
true. Thus, the function g is not continuous at i—consequently, g is continuous 
precisely at the rational points. Now the existence of r and g (actually of their 
restrictions to [0, 1]) is contradictory—the contradiction being the same as in 
part (a) of the problem; therefore such a function f cannot exist. 
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8. (Claudiu Raicu) For (a), we could define 


0, x=0 
1 
f= tl-a eQ, 
q q 
1, xER\Q. 


where we write p/q for a fraction in its lowest terms, as defined in the statement 
of problem 7. Actually, one sees that this is 1 — r, for r defined in that 
problem, too. 

Similarly, for (b) it is enough to choose 


rea ee oe 
q 0,p=0° 
Let us now solve (c). Suppose, by way of contradiction, that such a function 
exists. Let 
hH#2h2Dh2...2h-1, 

such that f(,) C [k,oo) fork = 1,2,...,n — 1. Then there is some x, € 
QN,- with f(x,) > n and consider a segment J, C J,-; such that f (x) < f(x) 
for all x € JI. Such J, exists because x, is a point of local minimum. Then, 
Ff Un) C [n, co). We can continue inductively to construct a sequence 


Ihp2Dh Dhe2... 


such that f (I4) C [k, co) for all k. If 


oo 
XE N Ik, 
k=0 


then f(x) > k for all k, a contradiction. 
9. Suppose, by way of contradiction, that the conclusion is false. Then there is a 
sequence x, —> oo such that 


lim f(x) =1# 0. 


Suppose, without loss of generality, that / > 0 and let 0 < e < l. Then there is 
N E€ N such that f (xn) > € for all n > N. By continuity, it follows that for every 
n > N, there is a closed interval Z, = [,, By] > xn such that f(x) > € for all 
xE. 
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We will prove that there is x € R such that {nx : x € R}Nk # Ø for 
infinitely many k € N. Clearly we can assume, without loss of generality, that 
B n— An > 0. 

We will build inductively a sequence of closed intervals (Jj, ),,., as follows. 
Choose k; arbitrarily. Consider k, fixed. It is easily checked using „n — a, > 0 
and @,, Ên —> oo that for large enough k+, there is M, € N such that 
[Mcx,,,MnBr,] D [Akis Bk, 4]. SO, let us choose k,,4; with this property. By 
the construction, 


lær, Êr] D E fa | 5 | Qk Pr | : 


M: M; MM2’ MıMz 
lee) 
Ok, Bka 
= l ni y Boje 
n=1 1 i Ii i 


Then, for x from this set, we see that, in fact, {nx : x € R} O4 Æ Ø for infinitely 
many k, which contradicts the fact that lim f(nx) = O since f(nx) > e for 
n> 


infinitely many n € N. Hence, our supposition was false and the problem is 
solved. 

Note that this problem (called Crofts lemma) was also discussed in 
Chapter 1. The reader might be interested to compare two wordings of 
(basically) the (same) solution. 

It is clear that f (0) = 0 and f(x) < x for all x > 0. There are 2 possible cases. 

First Case. There is some 7 > 0 such that f(x) = 0 for all x € [0, n]. Then, 
for all x, there is some open interval V(x) 5 x and some n(x) such that 


F'O VW) C O, n). 
Let x1, x2, . . . , Xp such that 
V(x) U VQ) U ... U VŒ) D [0, 1]. 
Then it is clear that for 


n = 1 + max n(x), f"=0. 


l<i<p 
Second Case. We build inductively a sequence I, = [a,, b,] with the properties: 


f(x) > 0, Vx € I, and bn+1ı < max f(x) 
x€ln 


The construction is clear due to the hypothesis of the second case. Let us define 


Kn = {xe l fŒ) E Decal’ @) E I,} : 
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11. 


We obviously have K, # Ø and K, D K,+4,. Since K,, is compact, we have 
xE N Ky. 
n 


For such x, 


0 E tx FO). SF). LF FO), «5 


which is a contradiction, hence the second case is impossible. 

Let f : [a,b] > R be a continuous function, differentiable on (a, b). We intend 
to prove that there exists c € (a, b) such that R(a, b) = f’(c), where we define, 
for all x,y € [a, b], x Æ y, 


_ fe) -fO) 
x—y i 


R(x, y) 
We also let 


RG, x) = fœ) 


for every x € (a,b). To begin, we note the following properties of the 
function R. 

First, R is continuous at any point where it is defined, by the continuity of f 
and by the definition of derivative at points (x, x), x € (a, b). It follows that, if 
we fix some a@ € (a,b), the function x + R(«, x) is continuous at every point 
of [a, b]. 

We also have that, for all x < y < z in [a, b], R(x, z) is between R(x, y) and 
R(y, z). This is because 


R(x,z) = RG, y) + RG, 2 = (1 — AR, y) + tR, 2) 

z—x z=—x 
with t = (z — y)/ (z — x) € (0, 1). This is an almost trivial identity (isn’t it?) but 
it becomes critical in this proof of Lagrange’s theorem (due, as far as we know, 
to the Romanian mathematician Dimitrie Pompeiu). Note that if two of R(x, y), 
R(y,z), and R(x, z) are equal, then all three are equal. Of course, R(x, y) = 
R(y, x) holds for every distinct x, y € [a, b]. 

And now for the proof. If we have R(a,x) = R(x,b) for all x € (a,b), 
then R(a,x) = R(x,b) = R(a,b) and f(x) = R(a,b)(x — a) + f(a), hence 
f'(x) = R(a,b) for all x € (a,b), and there is nothing left to prove. Thus 
we can assume that there is such u € (a,b) with R(a,u) # R(u, b), say with 
R(u,a) = R(a,u) < R(u, b). Because the function x + R(u, x) is continuous 
and R(a,u) < R(a,b) < R(u,b), there exists v € (a,b) such that R(a,b) = 
R(u, v). If v = u, this means R(a,b) = R(u,u) = f’(u) and the proof ends 
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here. Otherwise, we have an interval J; = [a,, b1] (with {a,, b1} = {u, v}) such 
that a < a, < bı < band R(a, b) = R(a;, bı). 

Now let a, = (a, + bı)/2 and look at R(a),a) and R(a}, b1). If they 
are equal, then (as we noticed above), we also have R(a1, b1) = R(a;, a4) = 
R(a}, bı); in this case, we consider the interval J, = [a2, b2] to be one of the 
intervals [a;, a5], [a,, b1] (it doesn’t matter which). Observe that J is included 
in Jı, has length half the length of J}, and has the property that R(a,b) = 
R(aı, bi) = R(~, b2). If not, then R(a;, bı) is strictly between R(a;,a,) and 
R(a}, bı), and, since the function x > R(a‘,x) is continuous, there must be 
some b} € (a1, bı) such that R(a;,b;) = R(a}, b5). If a, = b}, the proof ends 
here, because R(a,b) = R(a;ı, b1) = R(aj,a,) = f'(a}). Otherwise, we just 
built an interval J, = [a2, b2] which is included in /,, has length at most half the 
length of 7, and which is such that R(a, b) = R(a,, b1) = R(a2, b2). Of course, 
a and b; are (in the second case) a’, and b’, in increasing order. 

Clearly, this process can be iterated in order to get a sequence of nested 
compact intervals J; D h D> --- such that each 7,41 = [an+1, bn+1] is included 
in J, = [an, bn], has length at most half the length of J,, and also, we have 
R(a,b) = R(aı, bi) = --- = R(q, bn) = ---. (If at some moment we get 
an = bn, the proof ends because this means R(a, b) = f’(a,). Otherwise the 
process continues indefinitely.) Moreover, the first interval is included in (a, b) 
(and, consequently, all of them are). This means that the common point c of all 
intervals Z, is in (a, b), too. As 1,41) < C1/2)/C,) for all n, the sequence of 
lengths of intervals goes to zero; therefore we have 

lim a, = lim b, = c 
n—->oo noo 


and, finally, 
R(a, b) = lim R(an, bn) = R(c,c) = f' (o). 
noo 


Note that this (rather complicated) proof shows that Lagrange’s theorem can be 
obtained with very little knowledge on derivatives (basically, only the definition 
of the derivative is needed). Thus we can obtain other theorems (such as Rolle’s 
theorem) using a method different from the usual one, and we can obtain the 
equivalence of the important theorems in the analysis on the real line. 


Chapter 9 
The Splitting Method and Double Sequences 


We use here the splitting method to establish some useful convergence results. The 
splitting method is a useful tool to compute the limits of certain sequences of real 
numbers, whose general form is a sum s, of n terms, which do not behave in the 
same way. In fact, this method consists of decomposing the sum s, into two sums, 
which are analyzed separately, using different methods in general, adapted to the 
behavior of the terms composing them. We give first some examples of problems 
which use the splitting method and then some general results, with lots of practical 
applications. 

The next problem is typical for this type of argument and was discussed in the 
Jury of the Romanian Mathematical Olympiad, 2002. 


Problem. Let f : [0,00) — [0, 1) be continuous, with lim f(x) = 0. Prove that 
Xx—> 0O 


1 2 n 
im |S roa f iwa f sonar] =o. 


(Here, f*(x) denotes (f(x))*.) 


Solution. Let M € (0, 1) satisfy f(x) < M, for all real numbers x € [0, 00). Let 
us see why such an M exists: there exists some n such that if x > n, then f(x) < L, 
On the other hand, f has a maximum M; < 1 on [0, n] because it is continuous. Thus 
M = max(M,,0.5) is a possible choice. Let us take € > 0, actually £ < 1/2, and 
consider a rank k for which f(x) < e€ for all x € [k, 00). We have the decomposition 


n 


1 2 
a n n—1 oe 
a= f rodat f f (dr tet | fdr 


n— 


1 2 k 
ee n n—1 a n+1—k 
= ( i f'(xdx + J Fods- / _f coax) 
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k+1 k+2 k 
+( PIOA / Pade + f fsa). 
k k+1 ae 


First, 


1 2 k 
n n—1 ind n+1—k 
/ f oar f Sf" (x)dx+--- + Js (x)dx 


M"+!=k 


aM AM kn p mt! < . 
~ 1—M 


Mtti-k 
= 0, so we can find a rank nı for which 


Further, lim 
noo 


1 2 k 
[ rox f Pod f fr dees, Yn>n. 
0 1 k-1 


On the other hand, 


k+1 k+2 n 
f(x)dx + fads -+ f f(x)dx 
k+1 n—-1 


E 
A a er ei 
=E 


so 


k+1 k+2 n 
f(x)dx + fo" Warten: f fœ)dx < 2e. 
k+1 n—1 


Finally, by adding the previous inequalities, we obtain zn < 3e, V n > ny, so the 
problem is solved. L 


The reader will immediately be convinced that the following problem is not an 
easy one. However, it becomes much more natural in the framework of the splitting 
method. 


Problem. Prove that 


0 ee ee ae e 
lim = 


noo n” e-1 


Solution. If we write the sequence starting with the last term, we obtain 


ee Gor a er eas 
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1\" 2\" n—2\" n—1)\" 
patie) Cee ee ee 
n n n n 


Now we can see that the sequence (a,),,, is closely related to the limit 


or 


e 


lim (1 +e! +e? +--+ e™”) = 


noo e— l ` 

More precisely, using first the inequality 

k n 

(: — =) < ek, 

n 

we deduce that 
1 
ee a piepe le =e 
—e 


e 
ora, < —, Yn> l. 


a= 
On the other hand, for all integers n > k > 1, we have 


1\” 2 n kN” 
az1i+(1--) eS =| aaa fy 
n n n 


If we consider now the inferior limit with respect to n, we obtain 


1\" 2\" k\" 
lim inf a, > lim inf [i+ (1-5) +(1-2) ++ (1-5) | 
n—>oo noo n n n 


=1te'+e?+e te, 
SO 


lim inf a, > 1 +e! +e? +--+ e™, 
n—>Co 


for all positive integers k. Further, by taking the limit as k — oo in the last inequality, 
we infer that 


lim inf a, > 
noo e— l 


Now the conclusion follows from the inequalities: 


ee e 
< lim inf a,, a, < —. O 
noo 


g= e-1 
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Here is another typical application of the splitting method: 


Problem. Let (an)„>;ı be a decreasing sequence of real numbers with 


lim a, = 1. 
noo 


Prove that the sequence with general term b, = (1 + 1) ( 1+ 2) tee (1 + =) 
converges to e. á 7 . 


Solution. Let us consider the term 


bopa (es ee pe Pie 
en aE k+m k+m k+m 
ak+1 Ak+m 
RE ie a ape A 
(+e) ( E) 


The product of the first k factors can be estimated as follows: 


1+ a 1+ a2 1+4 ak 
k+m ktm) k+m 


( a) j ka 
< {1+ — | < emm 
k+m 


Assume next that for a given £ > 0, kis chosen such that 1 < ak+m < 1 + e, for all 
positive integers m. Hence 


In conclusion, 


kaj 
Dbițm < erm Tlte < elte, 


for fixed (but arbitrary) € > 0 and sufficiently large m, because for big enough m, 


we have a < e. Since £ can be any positive number, this implies lim sup b, < e. 
n—>Co 


On the other hand, 


(42) < (142) (04+%)-(4%)=0, 


for all n > 1; this yields e < liminf,..5 bn and finishes the proof. O 
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Alternatively (but without using the method that we are discussing now), if one 
considers the sequence with general term cn = b,/(1 + (1/n))”, then one notes 
(like above) that 1 < c, for all n > 1. Yet, by the inequality 1 + x < e*, x € R that 
we already used, we see that 


ay— 1 a— 1 an— 1 d 
Ch = {it 1+ TA e < eh, 
n+ 1 n+1 n+ 1 


Ae Deck (aD 
F n+1 i 


where 


dn 


and lim d, = 0 follows from the hypothesis and the Cesaro-Stolz theorem. Thus 


noo 
L\2 1X? 
1f2) <b = (14) e 
n n 


we have the inequalities 
for all n > 1. Now the squeeze theorem finishes the proof. 0 

Finally observe that the condition about the monotonicity of (d,),>1 is not 
necessary; we can assume only that a, — 1 anda, > 1 for all n, and both proofs 
work. 

It is not difficult to see that by changing the order of the terms of a convergent 
series, usually we do not obtain the same sum (a famous theorem of Riemann asserts 


much more: if (a) is a sequence of real numbers and Xa, is convergent, but 
n>1 

>» jan| = oo, then for any œ € R, one can permute the terms of the series 2 an 

n=1 n>1 

so that the sum of the resulting series is œ). However, if the series is absolutely 

convergent, we have the following beautiful application of the splitting method: 


Problem. Let (a„)„ be a sequence of real numbers such that 5 |a,| converges. 
n>1 
Let o be a permutation of the set of positive integers. Prove that 


oo 
dn = J Ag(n)+ 
1 n=1 


(oe) 


n= 


Co 
Solution. Because y lan| < ©, > a, converges. Let / be its sum and let us 


n>1 n=1 
oo 


prove that 5 Ag(n) converges to l. Let € > 0 and choose N such that > lan| < €. 


n=1 n>N 
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Let M > N be such that {o(1),o(2),...,0(M)} contains {1,2,...,N} and let 
m > M +N. Finally define A = {1,2,...,m} \ {1,2,...,M — 1} and B = 
{o(1),...,0(m)} \ {1,2,...,N— 1}. Then 


m 
> Agi) — l| < 
i=l 


m 


X a-l + 
i=1 


m m 
Se Ao (i) — 5 di 
i=1 i=1 


Yia- ai < X lal +X lail + 


i€B i€A i€A i€B 


<2e+ gel r 
i=1 


Yai 


m 
i=1 


Co 
Because / = > ai, for sufficiently large m, we have 
i=1 


m 


Xal 


i=1 
lee) 


< 3e for sufficiently large m. This shows that 5 ag) = 1.0 
i=1 


< € and thus 


m 


5 đo(i) — l 


i=1 


The following two problems have shorter solutions, but are far from being 
obvious. 


Problem. Let (a,),>1 be a sequence of positive real numbers such that 5 an 
n>1 


j—l 
converges. Prove that > an ” also converges. 


n>1 


I- Bee te ; 
Solution. The idea is that if an s ven small, so is a, ", while if a, is not very 


1— 
small, say an > M, then an " < m Therefore we split the sum into two parts: 
one corresponding to those a, greater than or equal to 27”, and the second one 
; 1-1 F 
corresponding to a, < 2”. If a, > 27”, then a, " < 2an, so the first sum is 


1 1 
bounded by 2 5 an. If an < 2”, then ae "< JE so the second sum is bounded 
n>1 
1 at 
by 5 mT Therefore the partial sums of 5 al ” are bounded and so the series 
n=1 n>1 
converges. O 


Jnt+ ynt + "n+ Yn 


n 


Problem. Find lim 
noo 
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3 P n—1 n PEP 1 
Solution. If a, = SAEN E eN A then clearly a, > É : 
n n 
On the other hand, let M > 0 and split XVn ins = >. s/n and S) = 


i=2 2<i<M Inn 
5 </n. The first sum is at most MInn,/n = o(n), while the second one is 


n>i>Ml|nn 


L : ' 1 4 
at most n- e™”, because for i > M lnn, we have ¥/n = ni < nmmn = em. Thus 
Si + S2 p Minn 


n n 


E 
+ en, Now, if € > 0, there is M with em < 1+ = and for such 


Minn 
a fixed M, we have 


< : for large n, thus a, < 1 + € for large n. Therefore 
n 


lim a, = 1.0 
noo 


Let us give now some important theoretical results concerning double sequences 
and their convergence. We discuss this in the same chapter because, as the reader 
will immediately notice, all proofs are based on the splitting method. We will 
discuss Toeplitz’s theorem, which is in fact a useful convergence criterion for a 
class of sequences of real numbers. We will study some of its consequences and 
applications, with a particular emphasis on a very useful result, the Cesaro-Stolz 
theorem. 


Theorem (Toeplitz). Let A = (ank)n>x>1 be an infinite triangular matrix with 
nonnegative entries, 


a, 0 0 0 0 

a21 422 0 0 0 

a3 432 a33 0 0 
A= 31 432 433 

Anl Gn2 An3 ... Ann... 


Assume that: 
n 
a) The sum of the elements in each row is equal to 1, i.e., X an = 1, for all 


k=1 
positive integers n. 


b) Each sequence determined by each column is convergent to zero, i.e., 
lim aną = 0, for all positive integers k. 
n->Co 


Then, for any sequence (ty),>1 which has a limit (finite or infinite), 


n 
lim ) Ant, = lim ty. 
noo = noo 
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noo 


n 
Proof. Leta = lim f, and let?, = X antr, n>1. 
k=1 


First, we consider the case a € R. By changing (ta)n>1 to (tn — @)n>1, We can 
assume that a = 0 (note that we used here the first hypothesis of the matrix A). 
For any £ > 0, we can find a positive integer N = N(e) such that |t| < 5, for all 
integers n > N. If n > N, then 


th = (amti +++ + anty) + (an NtitN+1 +++ + antn). 
From b), we deduce that 
lim (anıtı Hees a Anntn) = 0, 
noo 


so there exists n, > N such that 


E 
lanity +--+ + annty| < 7 Yn >n. 


Finally, for all n > ns, we have 


n 


n 
E E E E 
lin] <5 + ` anlal S 5 + 5 Dank = 5 + 
k=N+1 k=1 


= E. 


NI 


To complete the proof, we assume now that t, — oo, as n — oo. Let M > 0 and 
N > 1 beso that t, > 3M, for all integers n > N. We can find N; > N with 


lanti +++: + dunty| < M, 


for all integers n > N; and let No > N; be such that anı + +++ + ann < i. for all 
integers n > M2. Now 


Ta = (amti +++ + dnytw) + (Gn.n+itwr +++ + Annt) 


n N 
>—-M+3M J` an =-M+3M(1 -Y an) 
k=N+41 k=1 


1 
> -m+3m(1->) =M, 


for all integers n > N> and consequently tn —> œ, as n —> œ. O 


A direct consequence that is very useful in practice is the not so well-known 
Cesàro-Stolz theorem: 
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Theorem (Cesaro-Stolz). Let (x;),>; and (Yn),>, be two sequences of real 
numbers. Assume that (y,,),,>1 is increasing, unbounded, and that there exists 


7 Xn — Xn-1 
a= lim ———. 
noo Yn — Yn—-1 


. Xn 
Then lim — =a. 
noo Yn 


Proof. We have the following identity 


n n 
Xn es Xk — Xk-1 
eg . = Anklk, 
Yn k=l Yn Yk — Yk-1 k=1 


where 


Yk — Vk-1 Xk — Xk-1 
ank = ——, kk = —W—, 
Yn Yk — Vk-1 


and xọ = yo = 0. For all n > 1, we have 
” T Yk — Yet 1 
J an = >> = —- On — 0) = 1 
tl te Yn Yn 


and for each k > 1, 


: . Vk — Yk-1 
lim ay, = lin —W—— = 0, 
noo noo Yn 
Xn — Xn-1 


oe 
> Q, SO f, = — > g. O 
Yn — Yn-1 Yn 


To see the strength of this criterion, consider the following B6 Putnam problem 
from 2006: 


because y, — œo, as n > co. Now t, = 


Problem. Let k be an integer greater than 1. Suppose ap > 0 and define 


1 


An 


antl = An + 


a+! 
for n > 0. Evaluate lim +~. 
n>oo ” 


Solution. It is clear that the sequence is increasing. We claim that it tends to oo. 
Indeed, otherwise it converges to a finite limit / and passing to the limit in the given 
relation, we obtain a contradiction. Now, in order to get rid of n* and to apply the 


Cesaro-Stolz theorem, it is better to take the kth root of the sequence whose limit 
I+1/k 


an 


we are asked to compute. Thus, we will use Cesaro-Stolz for the sequence “— 
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If we manage to compute the limit of alt" e alt" F then the previous sequence 


will have the same limit. However, this boils down to finding the limit as x tends to 


oo of 
1 1+1/k 
(=+ z) — x! tlk 


By changing the variable to x!/* = t, we need to compute the limit for t > 00 of 


1 1+1/k 
k+1 
t ((: + at) = ) : 


However, it is clear that this limit is 1 + 1/k, simply because another change of 


4 n: L 1+1/k_ 
variable (%7! = u) shows that it is the same as the limit of ea 


as u tends 


+1/ 
to 0. Thus the limit of a is 1 + 1/k, and now it is clear that the answer to the 
problem is (1 + 1/k)*. O 


Using the same splitting method, we will establish probably the most useful result 
of this chapter, Lebesgue’s dominated convergence theorem for sequences. It gives 
a very easy-to-verify condition for passing to the limit in an infinite sum. 


Lebesgue’s theorem. Let (amn)mn>ı De a double sequence of real numbers for 
which there exists a sequence (an),>, such that lim amn = an and |amı| < On, 
= m—>Co 
CO 


for all positive integers m,n and some sequence (dy),>1 . Then, if the series Xo, 


n=1 
converges, 


[o.@) CO 
lim > Ann = J an. 
m—> oo 

n=1 n=1 


CO CO 
Proof. First note that |a,| < œn, so the series X am and Xa, are absolutely 


n=1 n=1 

convergent. With the notations 

[0.6] [0.0] 

o = ) an, th = ) Amn, 

n= n= 

we have 
p CO 
O — tml < ) |an — Gmn| + ) lan — amn 
n=1 n=p+l 


[0.0] 
y Qn. 


n=p+1 


P 
< 5 jan = amnl + 2 
=1 
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[0.0] 
For each real ¢ > 0, let np be such that > An < E, Then for sufficiently large m, 


n=ngo 


no 


€ 
|o — tml < $` lan — am| + 2 < no 


n=1 


ie m 
t pi nE, 
2no 2 


We end this theoretical part with an application of the previous result: 


Problem. Let a > 1 be a real number. Prove that 


n k 
lim Ve~” ye en) = arn 
o E V 2r(& — 1) 
Solution. Let 


o oan _ (an)" n! 1 
TSR kon! LGB! eF 


k=0 i k=0 


1 1 
Let Ank = m- H! : (ane if k <n and 0 if k > n. Then 0 < ank < ae and 
we deduce that 


because ) —z converges and lim a, = at 
7 a noo a 


n 


im . = = 3 
noo (n—k)! (an) a ak a-l 


k=0 


Therefore 


n 
e 


It is enough to use Stirling’s formula n! ~ ( ) /21n to deduce that 


Ji Stank a 
Be ae e A 
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Proposed Problems 


1. Let (ank)nk>1 be a triangular matrix with the following properties: 


a) lim ang = 0, for each k > 1; 
noo 
b) there is K > 0 such that 


n 
Yo lanl <K, 
k=1 


for all positive integers n. If the sequence (t,)„>ı converges to zero, prove 
that the sequence Gist given by the formula 


n 
th = ) Anktk, 
k=1 


also converges to zero. 
2. Assume that the hypothesis of Toeplitz’s theorem hold. Prove that if (tn),,>1 is 
bounded, then Gist is bounded and 


sup [f,| < sup |t] . 
n>1 


n>1 
3. Let (n)y>0+ On)n>o be sequences converging to zero and such that the series 
Co 


PS Yn is absolutely convergent. Prove that 


n=0 


lim (Xo¥n + X1Yn-1 +s + Xn-11 + Xnyo) = 0. 
noo 


4. Let (Xn)ns0+ Wn)n>o be Sequences convergent to x, respectively y. Prove that 


$ X0OYn + X1Yn—-1 ee Xn—-1Y1 + Xnyo 
lim = 


noo n 
5. If x, —> xas n — ov, prove that 


OROSO 


lim = 


n>oo Qn 


6. Let (%n),> , be a sequence such that lim (xn+1ı — Ax,) = 0, for some |A| < 1. 
n—>Co 


Prove that lim x, = 0. 
noo 
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7. Let (%n),>1 be a sequence such that lim (10xn+2 + 7Xn+1 + Xn) = 18. Prove 
p= noo 


that lim x, = 1. 
noo 


8. Let (%n),> 1 be a bounded sequence such that 


3Xn+41 — Xn 

Xn+2 S a ` 
for all positive integers n. Prove that (x,),,>1 is convergent. 

9. Let (dn) n> and (bn)„>o be sequences of real numbers such that 


Dy = an — &an+1 
with @ a fixed real number, |a| < 1. Assume that (bn )„>o is convergent and 


lim aa” = 0. 
n—>Co 
Prove that (an)„>o is convergent. 

10. Let (amn)mn>1 be a double sequence of positive integers and assume that each 
positive integer appears at most ten times in the sequence (amn) Prove 
that there exist positive integers m, n such that dy, > mn. 

11. Let (an)„>ı be a sequence of positive real numbers such that the series 


mn>1° 


converges. Prove that 


00 2 
1 1 
Evo lE) 


m=1n=1 


then find the sum of the series 


2 D 3m(n.- 3m +m.: 3") 


m—1n=1 


(oe) 


12. Let a, € C be a sequence such that lim na, = 0 and lim ye anx” = 0. Prove 
noo 


x 
n=0 
lee) 
that Se an = 0. 
n=0 
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13. Let f : [0,1] > R be a Lipschitz function and let the series X`% o a, be 
convergent with sum a. Prove that 


jim Dat (7) = af (0). 
k=0 


14. Let f : [0,1] > R be a continuous function and let the series }°7°9 ax be 
absolutely convergent (hence convergent) with sum a. Prove that 


jim, Da (7) = af (0). 
k=0 


Solutions 


1. For € > 0, we consider an integer n(e) for which |t| < 
n > n(e). For every n > n(e), we have 


zg for all integers 


n(é) n 
lin < Se anktk| + D [anktk| 
k=1 k=n(e)+1 
n(e) n x n(£) z 
= So ant + ya lank] |> aK = So ant + 7 
k=1 k=n(e)+1 k=1 
Let k(£) be such that 
E 
lnl 


(lal H lo D 


for all integers n > k(e) and all m = 1,2,...,n(e). 
Hence for every n > n(e) + k(e), we have 


n(£) n(e) g 
Dont < 2 lam: It] < Ie re =o (dal +++ lno) = 7 


Finally, [f,| < £, for all integers n > n(£) + k(e). 
2. The proof is identical to the previous one. 
3. We can use problem 1. Let us put ank = Yn— for n,k > 1, n > k. We have 


lim ang = lim y,-~ = 0, 
noo noo 
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for each positive integer k and 


n n 
5 lank| = 5 IYn—k| < K, 
k=1 k=1 


Co 
for some K > 0, because of the absolute convergence of the series > ye. Now 
k=0 
let t, = X,—-1 for all n > 1, hence (¢,),>1 has limit 0. By the result of problem 1, 
the sequence with general term 


n n 
th = > Anktk = > Yn—kXk—1 
k=1 k=1 


also converges to zero—which is the desired conclusion. 
We can also give a direct proof. Let M > 0 be such that |x,| < M and 
|yn| < M for all positive integers n. For € > 0, let n(e) be such that 


E 
|Yn+1] + [Yn+2| + ++ + [Yn+p] < zM 
for all integers n > n(e), p = 1, and |x,| < >¢ for all integers n > n(e), where 


Co 
K= 5 |yn| . Now, for all n > 2n(e), we have 


n=1 
xX1Yn—1 + X2Vn—2 + ++ + Xp—2Y2 + Xn-191| 
< (Ixil [Yn] + [eal [Yn-2] + + aoil [ynce+1]) 


F (|an) ` lyno] apes E [xn] ; yıl) 


4. We can assume x = 0, by taking (xn — x)n>0 instead of (xn)n>0. Further, we can 
assume that y = 0 by changing (¥n)n>0 to (Yn — y)n>0. If M > 0 is such that 
lyn| < M, for all nonnegative integers n, then 


X0Yn + +++ + XnYo as [xo] + lyn] + +++ + [xnl + lyol < y. ELE + bal 
n = n “+ n i 


By the Cesàro-Stolz theorem, 


[xol +: + eal 
lim ~ ————_. = 


noo n 


0, 


so the problem is solved. 
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Observe also that if the two sequences converge to 0 (we saw that we can 
always make this assumption), then the result is clear from Cauchy-Schwarz’s 
inequality combined with the Cesaro-Stolz theorem: 


X0Yn + X1Yn-1 seek XnYo 
n+1 


[atatt Eee 
> n+1 n+l ' 


5. We can use Toeplitz’s theorem with 


li sieo ANa 
-aoo 
Finally, 


n n 1 i 
0 k=0 


We can also give a proof which uses the splitting method. We assume that 
(X%n)n>0 converges to zero. For arbitrary fixed € > 0, let n(e) be an integer for 


which |x,| < 5, for all integers n > n(e). Let us put M = sup {|x,| | n € N}. 


Then for all n > n(e), we have 


n n n 
(erhi) 


2n 


(J (cy 


2n 


4 lnoi] Hee Xn 
tf n(e) + 1 n 


Qn 


k= 


= 


Solutions 


First, 


n n 
(Jesti jia 


Qn 


JEA as 


<M. <M. = 
Qn 2M 2 


> 


for sufficiently large n, say n > n,(e). On the other hand, 


t [xne 1| Hrt ý RA 
n(e)+ 1 n 


Qn 
oa) FG) 
E n(e)+ 1 n £E 
= yn 
Finally, 
n Xo +: + R x 
0 0 n 
< 
2 fe 


for all integers n > max {n(e), nı (£€)}, which solves the problem. 


165 


It is interesting to note that in the case when the sequence converges to 0, a 
simple argument based on Cauchy-Schwarz’s inequality and the Cesaro-Stolz 
theorem does not work. Indeed, it boils down to the fact that the sequence whose 
general term is 4 - E9) is bounded, which is false (the reader can prove this, as 


4n 
an exercise). 
6. We know that the sequence 


Yn = Xnt1 —ÀAXn, NÈ 1 


converges to zero. From the equalities 


Yn Xn+1 Xn 


Anti E Agri An 
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we deduce that 


Now it is sufficient to prove that the sequence 
n 
= Aye naz] 
k=1 


is convergent to zero. This follows from a corollary of Toeplitz’s theorem, more 
precisely, from problem 1. Indeed, with a„ą = A”~*, we have 


lim a, = lim A” * = 0, 
noo noo 


for all positive integers k and 


1 


n a n ma 
DO = DOI = 


7. First we will prove a more general result. Let a,b,c be real numbers, a # 0 
such that the quadratic equation ax? + bx + c = 0 has two real solutions in 
(—1, 1). Then each sequence (x,)n>1 with the property 


lim (axn+2 + bXn+1 + CXn) = 0 
noo 


is convergent to zero. Indeed, if we denote by 4, u € (—1, 1) the solutions of 
the quadratic equation, then according to Viéte’s formulas, 


b c 
A+ m= -—-, àu =. 
a a 
Now, 
lim (aXn+2 + bXn+1 + CXn) =0 
noo 
becomes 
¥ b c 
lim Xn+2 F —Xn+1 + -Xn | = 0 
noo a a 
or 


im, [ent — (A + W)Xn41 + AUX] = 0 
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which can be also written as 
lim [(X%n+2 = AXn+1) — M(Xn+1 — ÀxXn)] =0. 
noo 
In this way, with the notation u, = Xn+1 —Ax,,n > 1, we have 
lim (un+1 — Un) = 0 
noo 
and |u| < 1. We have seen that this implies lim u, = 0, that is, 
noo 
lim (Xn+1 AXn) = 0. 
noo 


Using again the result of the previous problem, we conclude that (x„)n>1 
converges to zero. In our case, note that the quadratic equation 


10x° + 7x+1=0 
has two real solutions in (—1, 1). Let us denote a, = x, — 1, n > 1. We have 
lim (10an+2 + 7an+1 + an) = lim Oant — 1) + 7@n41-D + On - DY] 


= lim [(10an+2 + 7an+1 + an) — 18] = 0. 
noo 


This implies a, — 0 and then x, —> 1, as n —> oo. 
8. Let us define the sequence 


1 
Yn = Anti = ăn N zl 


Clearly, if (xn)n>ı is bounded, so is the sequence (yn)„>ı (just note that |y,| < 


IXntil + Fal), The given inequality allows us to prove the monotony of the 
sequence (y,)„>ı - Indeed, 


1 1 
Yn+1 — Yn = (nt = sin) = (mi = 5) 


3 1 
= Xn+2 — z+ T yn <0, 
so the sequence (y,),,., is decreasing. Hence (y,),,., is convergent and let / 
be its limit. Searching for a real number a@ so that the sequence z, = x, — a, 
n > | satisfies lim (Zn+1 — iza) = 0, we immediately obtain a = 2/. Thus 
n—>Co 


the sequence z, = x, — 21, n > 1 satisfies 


168 


10. 
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. 1 
im (<n = sn) = 0. 


and, as we know, this is possible only if zn — 0, so x, — 2l. 


. By induction, for n > 1, we have 


n-1 1 n—k 1 n—-1 
a= 5 by (=) maar >: b,ot*, 


k=-1 
where b_; = —aoa@. Since a,” tends to zero, the partial sums on the right-hand 
side in 
aal 
—a,a" = > ba“ 
k=-1 


also approach zero. They may therefore be replaced by the negatives of their 
corresponding remainders 


[0,6] lo.) 
—a,a" = — J bat or a, = ) banya“. 
k=n k=0 


If b is the limit of the sequence (b,,),>—1, then we can estimate the difference 


An — 


1 
< —— - sup |x b|, 
k>n 


b 
l-a T 1-|a| 


Cc 
S Ore — dat" 
k=0 


which becomes arbitrarily small for large values of n. Hence (an)„en converges 
tob/(1-@). 

Let us assume, by way of contradiction, that a,,, < mn, for all positive integers 
m,n. For every positive integer k, the set 


Ar = {i j) | ay < k} 


has at most 10k elements. On the other hand, A% contains all pairs (i,j) with 


ij < k and there are 
f + R ot 
1 2 kj 


such pairs. Thus, for all positive integers k, 


eH 
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This inequality cannot be true for large k, because 


ae EA 


k—> o0 k 


11. By interchanging m and n, we have 


= 2 2 Am (An F an) 7 2 È 1 Anam T An) 


m=1n=1 m=1n=1 


Thus 


is 2 >D (— + an) 2 An(Am + =) 


-5Sa (Sa) E)E) 


For the second part, let us take a, = = to obtain 


co œ 
Da peeern, 
m=1n=1 


if - 35 
n 
a - Yan me 3") -3(3:4) aN 


a 
m( B m=1n=1 n=1 


Indeed, if we differentiate 


1l-x 5 27 
n>0 
for |x| < 1, we obtain Tr E ee thus SS — =- 


n>1 
o0 


12. Let f(x) = > a,x", which is defined for |x| < 1 because a, is bounded. Then 


n=0 
forallO <x <1, 


N 


Xa, 


n=0 


N 
Xa + Soan — 1) 


n>N n=0 


< lf@)| + 


N 
< + Yo la" + YO lanl = x") 


n>N n=0 
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< If) + wy pha Yo $ (=) lala e. 


n>N n=0 


< [fo] + — sp ay 0-2) ala 


n=0 
N 


1 
<F@Ol+ 5 Naw we \na,| + N(1 =) Y nlan 
n=0 


1 
Choose x = 1 — —, then 
N 


N 
1 1 
< r(1 — x) SUD idal + WL "lanl. 


n=0 


N 
2 an 
n=0 


By making N — œ in the previous inequality, we obtain 


N 
lim J dn = 0; 
N->oo 
n=0 
because 


1 
lim (1 — x) =0, lim sup |na,| = 0 


Noo N>% n>N 
(because lim n|an| = 0) and 
n—>Co 


N 


ye vil 
py otal, 


n=0 


by the Cesaro-Stolz theorem. 

13. Solution I. The fact that f is Lipschitz means that there exists a constant L > 0 
such that |f(~)—f(y)| < L|x—y| for all x, y € [0, 1]; consequently, |f((k—1)/n)— 
f(k/n)| < L/n for all positive integers n and k < n. We have 


n 


A EE e OE) 


k=0 k=1 


+a +41 +: +a, —a)f (=); 
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therefore, using the triangle inequality and the Lipschitz condition, we obtain 


Yar (*) -aro 


k=0 


1 n 
SL- -9 la+ a + + am al 
= 


+ lag + ay +- +a, —al|fC)|. 


Now, the hypothesis tells us that limy,—+o9 |ag + a) +--+ + an — a| = 0, and the 
Cesaro-Stolz theorem ensures that 


1 n 

lim -X Jao + a1 +---+a,_, —a| = 0, 

n> n 

k=1 

as well. Thus, the sequence on the right hand side of the above inequality has 
limit 0, which proves that the left hand side also has limit 0, finishing the proof. 


Solution II. We need the following simple (and interesting by itself) lemma: 
if (Xn)n>0 is a convergent sequence of real numbers, then 


lim [Xn —Xn—1]) + + [Xn — xol =0 
noo n cee 


This can be proven by using the splitting method; we proceed further. 

Because (x;)n>0 is convergent, it is also bounded, hence there exists M > 0 
such that |x,| < M for all n; it follows that |x, — xn| < 2M for all m and n. 
Further, since (x,)n>0 is convergent, it is also a Cauchy sequence; consequently, 
for a given € > 0, there exists a positive integer N such that |x, —x,| < ¢/2 for 
all m,n > N. We then have, for n > max{N, 4NM/e}, 


[Xn —Xn—1| +++ + [Xn xo] Xn — Xna) + +++ + [Mn — NI 


n n 


= ani F + [Xn — —N)e 2NM e e 
4 Mn Xn-1| [Xn — Xo| g (n ) + aap 
n 2n n 2 


and the lemma is established. 
Now for the problem, we observe that 


n 


Vaf (=) SS af(0) 
k=0 


k=0 
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by an Abel summation formula again. The triangle inequality and the Lipschitz 
condition yield 


Vaf (=) -Y ag) 


k=0 k=0 


n—-1 


SL o lan +1 H aail 


n=] 


=L. Do n=k=1l; 


k=0 


CO 
where $, = do + ai +++: + Gp is the pth partial sum of the series 5 an. AS 
n=0 
(S;)n>0 is a convergent sequence, the lemma applies and shows that the right 
hand side from the previous inequality has limit 0; thus, the left hand side also 
tends to 0; that is, 


lim (>: arf (=) = Yaro) =0. 
k=0 


k=0 


Now lim > af (0) = af (0) completes the proof. 
noo k=0 


CO 


14. Solution I. Let A = 5 la,|; of course, |a| < A, and if A = 0, there is nothing 


to prove, as a = 0 and all an = 0 follow from such an assumption; thus, we 
may assume that A > 0. 

Let € > 0 be given. It is not hard to prove (e.g., by using the Stone- 
Weierstrass theorem—see the Chapter 7) that the set of Lipschitz functions is 
dense in the set of continuous functions (all defined on [0, 1], say). Thus, for our 
continuous function f (and the considered ¢) there exists a Lipschitz function 
g: [0, 1] — R such that |f (x) — g(x)| < £/ (3A) for all x € [0, 1]. Yet, according 
to the previous problem, for the function g we have 


k 
lim $ akg (<) = ag(0); 
noo = n 


consequently, there exists a positive integer N such that for all positive integers 
n > N, we have 


E 
L a 
3 


7 k 
So ag (=) — ag(0) 
k=0 
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Now, by the triangle inequality and all of the above remarks, we have 
n k k 
Ew) -C 
a n n 
Z k 
Das (5) -20 
k=0 
i k 
wer (£) — ag(0) 
r n 


k= 


n 


Yar (*) -ro 


k=0 


F + lallf) — g(0)| 


E £ E 
< Ss a 
at 2 |ar| + Foala 


È E E 
gmoney nage nae 
aA 3 A 


for every positive integer n > N, and the proof is complete. 


Solution II. The function f is continuous and therefore bounded on the 
compact interval [0, 1]. Thus there exists M > O such that |f(x)| < M for 
all x € [0,1], implying that |f(x) — f(y)| < 2M for all x,y € [0, 1]. Because 
the sequence of partial sums of the series $- ¿<o |ax| is convergent, it is also a 
Cauchy sequence, hence, for any positive £, there exists a positive integer N 
such that 


E 
m m po nl < >> 
lan] + lampil +++: + lal < 
for all positive integers m and n with N < m < n. Yet, because f is continuous 
on the compact interval [0, 1], it is also uniformly continuous; thus there exists 
ô > 0 such that |f(~) — f@)| < ¢/(2A) whenever x, y € [0, 1] and |x — y| < ô. 
And here comes the splitting: namely, for n > max{N, N/5}, we have 


Yo af (=) -X af O| < do lad lf (=) -f0) 
k=0 k=0 k=0 
al k k 
=} la F(*) -100 + D7 lal ($) -s0 
k=0 k=N+1 
E€ = z g E 
< xq 2 la] + 2M X lal < te S 
k=0 k=N+1 


and we are done. 


(Note that for 0 < k < N and n > N/6, we have |k/n — 0| = k/n < N/n < ô.) 

This is problem 180, from Gazeta Matematică—seria A, 3/2004, solved in 
the same magazine, 3/2005. The problem, as well as the previous one, was 
proposed by Dan Ştefan Marinescu and Viorel Cornea. 


Chapter 10 
The Number e 


The basic symbol of mathematical analysis is the well-known number e. It is 
introduced as the limit of the sequence 


1 n 
e=(1+3) ,n>l. 
n 


We have the following result: 


Theorem. The sequence (€,)n>1 is monotonically increasing and bounded. 
Moreover, the sequence (f,)n>1 defined by 


1 n+1 
f= (1+-) nèl. 
n 


converges to e and is decreasing. 


Proof. Indeed, in order to prove the boundedness, one can easily prove by 
induction with respect to k the inequality 


Then, by taking k = n, we obtain e, < 3. On the other hand, Bernoulli’s inequality 
gives 
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SO €n > 2. We proved that e, € [2,3] for all positive integers n. We use again 
Bernoulli’s inequality to establish that (€,),>1 is monotone; we have: 


enti _ 1 1 ” n+2 
en (n+ 1)? n+1 
w + 3n? +3n+2 
> 
n+ 3n2?+3n+1 


SO (€n)n>1 is (strictly) increasing. Clearly, lim fı = e. Finally, with Bernoulli’s 
n= 


inequality, 
5 1 n+1 1 
A =à) gees: 
Snt1 n(n + 2) n+2 

n+1 n+1 

>{1+ . 

n(n+2)/ n+2 

m+4n?+4n+1 
>1 

n> + 4n? + 4n 


SO (fa)n>1 is decreasing. This finishes the proof of the theorem. C 


More generally, we can consider the function e, : R —> R, given by 
enx) = (1+=) ,xeER. 
n 


As above, the sequence of real numbers (e,,(x))n>—x is increasing, and for all real 
numbers x, 


tim (1+ zy = 


n—> 0 
The following result is particularly important, so we present it as a theorem: 
Theorem. The sequence 


re el 1 
Xn = taita ty 


converges to e. Moreover, we have the inequality 


0<e-x, < 


n-n! 


and e is irrational. 
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Proof. We have, by the binomial formula, 


ar a EE 


ante 1 
< titat t g 
On the other hand, we clearly have 
<1+ ae 1412 23 
Xn = =. eee = . 
2 4 as 1-5 


Thus, (x;)n>1 is strictly increasing and bounded from above. From e, < Xn, we 
deduce that lim x, > e. On the other hand, for all p and all n > p, 
noo 


1 1 1 1 1 p-1 
a eae la a ae ec mee pier MC ce ae ; 
1! 2! n p! n n 


and if we take the limit as n —> oo, we obtain 


See ine 
e if 2 p! 


for all positive integers p. For p > œ, e > lim x,. In conclusion, 
prow 


ji 1 1 1 1 
hm Te T + zi Fa H =e. 
Now, for any positive integers n and m > 2, 


1 1 1 
GDh Ga ae 


1 1 1 
“mr arar ariar 


1 
in ts E (ios heed: 1 is 1 l-u 
(n+ 1)! n+2 (n+ 2)! (n+ 1)! - = 
Consequently, 
1 
In ke ar 


Xm+n ~ Xn S 7 : 
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For m — ox, it follows that 


1 n+2 
0<e-x, < ——— -—_., 
(n+1)! n+1 
for all positive integers n. Thus 


1 


ni-n 


0<e-XxX, < 


To prove that e is irrational, let us assume, by way of contradiction, that e = Dz with 
q 


positive integers p, q. Then 


EARE E 
q 1! 2! l! qabaq 


By multiplying with q! - q, we conclude that 


EE 1 1 1 1 9 
qi: p=4q!-4 uT Lae + O%, 


or 


6 =dp-gea(lt a4 toto), 
4 iD q! 


This is a contradiction, because 0} € (0, 1) and the right-hand side of the last 
equality is an integer. In conclusion, e is irrational and the proof is done. O 


Problem. a) Prove that e* > x + 1 for all real numbers x. 
b) Let a > 0 be such that a* > x + 1, for all real numbers x. Prove that a = e. 


Solution. a) Let us define the function f : R > R, given by 
f(x) =e-—x-1. 


Note that f is differentiable, with f'(x) = e*— 1, sof’ is negative on (—oo, 0) and 
positive on (0, co). Therefore, f decreases on (—oo, 0) and increases on (0, co). 
It follows that 0 is a point of minimum for f, which finishes the proof. 

Another solution is based on Bernoulli’s inequality: 


XN” Xx 
(1+=) =14+=-n=14x. 
n n 


Now, by taking the limit as n — oo in the previous inequality, we obtain e* > 
x+1. 
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b) Let us define the function f : R —> R, by f(x) = a” —x—1. The given inequality 
can be written as f(x) > f(0), for all x. Hence x = 0 is a point of minimum for 
f, and according to Fermat’s theorem, f’(0) = 0. But this is equivalent to a = e. 
Note that we can use only sequences in order to solve this problem. Indeed, if we 


1, RON ; : 
take x = — in the given inequality, we obtain 
n 


1/n 1 1\" 
a’/>--+4+1>a>{14+-], 
n n 


1 
and for n —> oo, we deduce that a > e. Similarly, by taking x = “api 
n 
n+l 
evens, | 4g< (=) 
n+ 1 n 
or 
1 n+1 
a< (1 + *) 
n 
For n — œ, it follows that a < e. O 
If we replace in (10.1) x by x — 1, we obtain the inequality 
e!l >x, (10.1) 


for all real numbers x. This inequality can be used to deduce the AM-GM inequality 


a + a2 +: tan 
n 


> 4/412... An; (10.2) 


for all positive real numbers a1, a2, .. . , an. The inequality (10.2) is equivalent to 


a + a2 +++: +a, = Nn, 


for all positive real numbers a1, d2,...,@,, With aja2-+:ad, = 1. Assuming this, 
note that 

ef! 1 > ay 

ef 1 > a 

än— 1 

e” = ay 


and by multiplication, 


e” tagt+e+ay—n >1 
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or 
a tagt--+a,-—n=>OSataate--+a, =n. 


This proof is due to G. Pólya and it is still considered to be the most beautiful proof 


of this famous inequality. 
In the framework of this new result, we can give another proof for the mono- 


tonicity of the sequence (e,),>1. Indeed, 


1 1 1 
1 = = 
ae (=) aa =e ie 
a Mmaa 
n—1 times 


i 1 1 1 4 

>n . eee . 

n—-1 n-—l1 n—l n 
1 


“N = i‘) z \ (1 Ti T 


which implies e, > e€,-1. 
Further, we use in a different way the AM-GM inequality to obtain other 


interesting results. Indeed, 


ma) (ie 1 yr Mad er DON ae) A, 
n 


m+n+1 


1\” 1\"t! 
eG 
n m 


for all positive integers m,n. With the given notations, this means e, < fm for all 
positive integers m, n. In particular, 


n(n+2) 1 n+2 
n(n+2) < fayi => | 1 + ——— <{I+ 
N ( <a) ( rat 


n+)” m+ 
mat (n+ y 


= (n+ "(n+ ih a < n'(n+ Jyt! 


(= J ei 
> < => en < Cn41. 
n n+1 


This is (yet) another proof of the monotonicity of the sequence (€;)n>1, 
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1. Let F be the family of all functions f : R — R satisfying the relation 


f(x +y) = ff), 


for all real numbers x. 


a) Find all continuous functions from F; 
b) Find all monotone functions from F. 


2. Find all functions f : R > R which satisfy the following two conditions: 


a) fœ + y) = ff), for all real numbers x, y; 
b) f(x) = x + 1, for all real numbers x. 


3. Find all functions f : (0,00) —> R which satisfy the following two conditions: 


a) f(xy) < f(x) +f), for all positive real numbers x, y; 
b) f(x) < x— 1, for all positive real numbers x. 


4. For all positive integers n, prove that: 


e 1\” e 
m wag <e- (+5) opel 
b = E -e<£ 
2n+1 n 
5. Prove that 


1 n i 1 n+l e 
lim nj}e—{1+- = lim n|({1+- —e| =. 
noo n n> n 2 


6. Find the limit of the sequence (x,),>1 given by the implicit relation 


1 n+Xn 
(: + *) =e,n>1. 
n 


7. Find the limit of the sequence (x,,),>1 given by the implicit relation 


N+Xn 1 1 
1+- =l1l+= ++ ndal. 
n 1! n! 


8. a) Prove that e* > ex for all real numbers x. 
b) Prove that if a > 0 has the property a‘ > ax for all real numbers x, then 
a=e. 


182 


9. 
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a) Prove that e* > x° for all positive numbers x. 
b) Prove that if a > 0 has the property a* > x“ for all positive numbers x, then 


a=e. 
1 


10. Let a > 0 be such that x* > af for all positive numbers x. Prove that a = e™`. 
11. Let x < y be two positive real numbers such that x‘ = y”. Prove that there exists 
a positive real r such that 
1 r+1 1 r 
x={1l- , y=t{l- . 
r+1 r+1 
12. Let x < y be two positive real numbers such that x” = y*. Prove that there exists 
a positive real r such that 
1 r r+1 
(oJ ole) 
r 
13. Prove that: 
. Yn! 1 
a) lim — = =; 
n>œ n e 1 
K n+1 Va Sal) a 
b) lim ( Yn D! nl) a 
Solutions 
1. We have 


fa) =? (5) 20. 


If f(~) = 0 for some real œ, then 
F(x) = ff Œ- a) = 0, 


for all real x. Otherwise, we can define the function g : (0,00) —> R, by g(x) = 
Inf (x). We have 


gx +y) = Inf(x + y) = n FEO) 
= Inf(x) + Inf(y) = g@) + 80), 
so g(x + y) = g(x) + g(y). If f is continuous or monotone, then g has the same 


properties. Hence g(x) = ax, for some real a, and, consequently, f(x) = e“ in 
both cases f monotone and f continuous. 
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2. From f(0) > f*(0) and f(0) > 1, it follows that f(0) = 1. By induction and 
using the observation that f > 0 (because f(2x) > (f(x))? = 0), we deduce that 


Fr Hx Fe + Xn) = FOF a) -- fOr). 


x 
For x, = X2 = ++- =x, = —, we have 
n 


so 
x\n 
fa) = (1+7). 
For n — oo, we deduce f(x) > e*. On the other hand, 
1=f(0) = ff Cx) = eve * = 1, 


so f(x) = e". 
3. Clearly, f(1) = 0. By induction, 


fax.. Xn) <f) HFE) +++ +f On)- 


For x) = X2 = +++ =X, = %/x, we have 


FO) < nf (Wx) <n(V/e-1) 


For n — oo, we deduce that f(x) < In x. On the other hand, 
1 
x 


0 =F) <s0) +7 ( ) simx+ ine =0, 


so f(x) = Inx. 
4. a) The inequality is equivalent to 


or 


1 1 
1+Inn—-In (n+ ;) <nin(n+1)—nInn < 1+1n (n+ 5) -Intn 1). 
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This double inequality is true for n = 1,2, 3. For the left inequality, let us 
consider the function f : [4, 00) —> R given by 


f(x) = xln(x + 1) — (x + Dinx+ In (x+ 5): 


We have to prove that f(x) > 1, for all real numbers x > 1. We have 


x x+1 2 
"(x) = —— +1 1) - —— -1 — 
Fœ aq rk ) x ae ae 
with 
5x7 + 5x +1 


o= 


> 
x?(x + 1} (2x + 1)? 
It follows that f’ is increasing. Thus for x > 4, 
5 41 
’ "(4) =In- -—_ > 0. 
fa) >f 4 =z -— > 
Then f is increasing, so for x > 4, 


5t 9 


The right inequality can be proved in a similar way. 
Let us multiply the inequality 


1 
by 1 + —, to obtain 
n 


n+1 nt+2 n 
or 
n+2 yt". 2n+1 
= < é. 
2n+1 n 2n 
Thus 


Solutions 


or 


g 1 n+1 ê 
<{1l+- — e< —. 
2n+ 1 n 2n 


5. By multiplying the inequalities from the previous problem by n, we obtain 


ne 1\” ne 
—— <nle-(1+- < ——_ 
2n +2 | ( =) | 2n+ 1 


and 


The conclusion follows. 


6. We have 
1 n+Xn 1 
l+- =e>(n+x,)nf{l+-])=1 
n n 
=>n+ f 
n+ Xn = ——,, 
In (1 + 1) 
so 
1 
Xn = 
In (1 + 1) 
Using l’ H6pital’s rule, we deduce that 
, _ l-nh(1+}) 1-4 +») 
lim x, = lim Se K = lim — >= 

noo n>œ Jn (1 + 1) x>0 = In(1 +x) 

x-Indi+x) 1 

= m = x 

x—>0 x2 2 

7. We clearly have 
1 nO +++) 
Mee iy e ae 
In(1 + =) In(1 + =) 


Using the previous problem, it suffices to show that 


186 


10. 


11. 
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converges to 0. This is clear, because we know that 


1 1 1 
O>Infl+—+4+---+—]-1>In{1—- : 
1! n! en-n! 


. a) This is clear, because we have seen that e*! > x for all x, thus e* > ex. 


b) The function g(x) = a‘ — ax has minimum at x = 1; thus, according to 
Fermat’s theorem, g’(1) = 0. This implies a = e. 


. a) Let us consider the function 


fa) = =, x € (0,00). 


We have 


so f is increasing on (0, e] and decreasing on [e, oo). Therefore, 


1 1 1 
fa) <f@=>3— <5 


x 
> elnx < x > lnx < lne >x < e. 
b) If a* > x°, then 


Pi A lnx Ina 
lna* > Inx* © xlna > alnx & — < — 
x 


= ’ 


a 


hence a = e. 
If we denote 


f(x) = xInx, x € (0, 00), 
then a is an absolute minimum point of f. We have 
fa@=0614+na=0Sa=e. 


Indeed, x = e7! is an absolute minimum point because f is decreasing on 


(0, e~'] and increasing on [e~!, 00). 


1 
Let r be positive so that y = (: + *) x. Then 
r 


szy erz (f) 


Solutions 


Finally, 


1+ : 1 ay 
— -1x= = . 
4 r r+1 
12. By taking the (1/xy)-th power, we obtain 
i 1 1 1 
œ)» = 0» Sx = y’. 


Thus 


From the previous problem, there exists a positive real number r such that 


Les 1 1 a r i 
x r+1) \r+i1 


and 


Hence 
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13. a) As a consequence of the Cesaro-Stolz theorem, we have the root formula, 


: . Ant 
lim 2/a, = lim Z=, 


noo n>œ© An 
under the hypothesis that the limit on the right-hand side exists. Here, 


(n+1)! 
(m+ 
n! 


mi 


b) We have 


ati ( 
jim n (“Want DI- Yat) = = lim p m “Ve = ) 


n+ mF! 
val a A a NEA 


= Bud n n In n+l/ nF! “Tal 
Val 
pe ED] te sae "+Y [(n + 1)!" 
= —- lim + lim In —— 
e noo Yn e n>% n! 


1 1)! 1 i 1)” 1 
Se lim ini (a+ Dir _ 2e litt In ayat 1 
e n>% (n+! e n>% n! e 


Chapter 11 
The Intermediate Value Theorem 


Let J C R be an interval. We say that a function f : J —> R has the intermediate 
value property (or IVP, for short) if it takes all intermediate values between any two 
of its values. More precisely, for every a, b € J and for any À between f(a) and f(b), 
we can find c between a and b such that f(c) = A. A direct consequence of this 
definition is that f has IVP if and only if it transforms any interval into an interval. 
Equivalently, a function with IVP which takes values of opposite signs must vanish 
at some point. 


Theorem (Intermediate value theorem). The class of continuous functions is 
strictly included in the class of functions with IVP. 


Proof. Let f : [a,b] — R be a continuous function. If f(a)f(b) < 0, we will 
prove that there exists c € (a,b) such that f(c) = 0. We assume, without loss of 
generality, that f(a) < 0 and f(b) > 0. From the continuity of f, we can find € > 0 
such that f remains negative on [a,a + £) C [a, b]. Let us define the set 


A = {x € (a,b) | f is negative on [a, x) }. 


The set is nonempty, because a +€ € A. Let c = sup A. We will prove that f(c) = 0. 
Let (Cn)n>1 C [a,c) be a sequence convergent to c. From f(c,) < 0, we deduce 
f(c) < 0 and consequently c < b. If f(c) < 0, then from the continuity of f at c, we 
can find ô > 0 such that f remains negative on [c, c + 5). This means that c + ô is in 
A, a contradiction. In conclusion, f(c) = 0.0 


On the other hand, the first proposed problem gives an example of a function 
with IVP which is not continuous. 
Another important property of [VP-functions which we use here is the following: 


Proposition. Each one-to-one IVP-functionf : I > R is strictly monotone. 


Proof. Let us assume, by way of contradiction, that there are a < b < cin I for 
which f(b) is not between f(a) and f(c) (as it would be if f was monotone). One 
immediately sees that we can assume f (c) < f(a) < f(b), without loss of generality. 
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Because f(a) is an intermediate value between f (c) and f(b), we can find d € (b,c) 
so that f (d) = f(a). But this contradicts the injectivity of f, because d # a. O 


Problem. Prove that there are no functions f : R —> R with IVP, such that 
SG) =x, VxER. 


Solution. If x, y satisfy f(x) = f(y), then f(f()) = fA Y)) and further, x = y. 
Thus f is injective. But any injective function with IVP is strictly monotone. 


Consequently, f o f is increasing. This is a contradiction, because f o f = —I1p 
is decreasing. L 


Also, the above proposition implies that any increasing function having IVP is 
continuous. Indeed, such a function has one-sided limits at any point because it is 
increasing, and because it cannot “jump” values, these one-sided limits are identical. 
This is the idea that underlies the following problem, proposed for the Romanian 
National Olympiad in 1998: 


Example. Let f : R — R be a differentiable function such that 
if 7 1 
F@) sf (x+ 


for all x € R and all positive integers n. Prove that f” is continuous. 


Solution. Consider the following function: 


sw =a((re+ 1) -7009). 


By the hypothesis, fn has a nonnegative derivative; therefore it is increasing. Thus, if 
x < y, we have fa(x) < f(y) for all n. By making n —> oo, we conclude that f'(x) < 
f'(y). Thus, f’ is increasing. Using Darboux’s theorem, proved in Chapter 14, as 
well as the observation preceding the solution, we conclude that f” is continuous. C 


We now present a quite challenging problem in which the IVP plays a crucial 
role. Here is problem E3191 from the American Mathematical Monthly: 


Problem. Find all functions f : R —> R with IVP such that 


f@t+y) =fa+fO)) 
forall x,y E€ R. 


Solution. The constant functions are clearly solutions, so we suppose further 
that f is nonconstant. Let a = inf Im f, b = sup Im f, where Im f denotes the range 
of f. The assumption we made shows that a < b. We have f(y) = f(f(y)) for all 
y € R (by setting x = 0 in the given relation); thus f(x) = x for x € (a, b). 

Suppose that a € R. By IVP and f(x) = x for x € (a, b), we must have f(a) = a; 
thus there is d > 0 such that f(x) = x on [a,a+ 2d]. But then any 0 < t < d satisfies 
f(a—t) = a + s for some s > d, because if s < d then 
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a=f(tt+a-nN=f(t+f(a—-f) =f(at+s4+0) =at+st+t, 


a contradiction. On the other hand, f(a — t) > a + d for all t € (0, d) and f(a) = a 
contradict IVP. Thus it is impossible for a to be finite, that is, a = —oo. Similarly, 
b= œ and f (x) = x, for all x € R.O 


The following problem was the highlight of the Romanian National Olympiad 
in 2000. 


Problem. A function f : R? > R is called olympic if for any n > 3 and any 
A1, A2,...,An € R? distinct points such that f(A) = --- = f(An), the points 
A,,A2,...,A, are the vertices of a convex polygon. 

Let P € C[X] be nonconstant. Prove that f : R? > R, f(x,y) = |P(x + iy)| is 
olympic if and only if all the roots of P are equal. 


Solution. If P(z) = a(z — zo)” and f(Ai) = --- = f(A,), then all A; are on a 
circle of center Zo, thus are the vertices of a convex polygon. Now, suppose that not 
all the roots of P are equal and consider z1, z2 two roots of P such that |z; — z2| 4 0 
is minimal. 


: bh Z +z 
Let d be the line containing zı and z2, and let z3 = sl 1 * Denote by s1, 52 the 


half lines determined by z3. By the minimality of |z, — z2|, we must have f(z3) > 0 
and because 


lim f(z) = lim f(z) = œ 
|z|>00 |z|>00 
Si ZES2 


and f has IVP, there exist z4 € sı and z5 € s2 with f (z3) = f(z4) = f(zs), a 
contradiction. O 


Example. The continuous function f : R — R has the property that for any 
real number a, the equation f(x) = f(a) has only a finite number of solutions. 
Prove that there exist real numbers a, b such that the set of real numbers x such that 
f(a) < f(x) < f(b) is bounded. 


Solution. First, we will prove that f has limit at oo and at —oo. Indeed, if this is 
not the case, there exist sequences a, and b, which tend to oo and such that f(a,), 
f (bn) have limit points a < b. This implies the existence of a number c such that 
St (an) < c <f(b,) for sufficiently large n. Using the intermediate value theorem, we 
deduce the existence of c, between a, and b, such that f (ca) = c for all sufficiently 
large n. Clearly, this contradicts the hypothesis. L 


Now, let /; = lim f(x) and h = lim f(x). We have two cases: the first one 
x—> 00 z> 0o 


is when /; Æ hl. Let us suppose, without loss of generality, that J; < h, and let 
us consider real numbers a, b such that 1} < a < b < l}. There are numbers A, B 
such that f(x) < a for x < A and f(x) > b for x > B. Using the intermediate 
value theorem, it follows that a,b € Im (f) and clearly the set of those x for which 
f(x) € [a,b] is a subset of [A, B], thus bounded. The second case is h = h = I. 
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Clearly, f is not constant, so we can choose a < b < lin the range of f if Im (f) is 
not a subset of (l, 00), and we can choose / < a < b in the range of f if Im (f) is not 
a subset of (—oo, l). Now, we can argue as in the first case to deduce that the set of 
those x for which a < x < b is bounded. This finishes the solution. O 


Proposed Problems 


1. Prove that the function f : R —> R, given by 


1 
sin —, x € R \ {0} 
x 


fx) = 
0, x=0 
has IVP. 
2. Let f : [0, 1] — [0, 1] be such that f(0) = 0 and f(1) = 1. Does the surjectivity 
of f imply that f has IVP? 


3. Let f : [0,c0) —> R be a function continuous on (0,00). Prove that the 
following assertions are equivalent: 


a) f has IVP; 
b) there exists a sequence (an)n>1 C (0,00) for which 


lim a, = Qand lim f(a,) = f(0). 
n—>Co n—>Co 


4. Let f : [0, 00) — [a, b] be continuous on (0, oo) such that 


1 1 
li —J=a li =b. 


Prove that f has IVP. 

5. Let f : [a,b] —> R be a continuous function such that f(a)f(b) < 0. Prove that 
for all integers n > 3, there exists an arithmetic progression x; < x2 < +++ < Xn 
such that 


SED +f) +--+ +f Gn) = 0. 
6. Prove that there are no differentiable functions f : R — R, such that 


sinx, x E (—oo, 0) 
cos x, x € [0, co) 


f£'Q)-f@ = 


7. Prove that there are no functions f : R —> R with IVP such that 
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10. 


11. 


12. 


13. 


14. 


fE) = cos? x, Vx ER. 


. Prove that there are no functions f : R — [0, 00) with IVP, such that 


SG) = 2", Vx eR. 


. Find all functions f : R —> R with IVP, such that f(2) = 8, f(—2) = —8 and 


fF) =f), VxER. 


Pays [0, 1] > R be inte rale; Assume that 0 < a < c < b < d< 1 such that 
[ f(x)dx <0 and f f(x) dx > 0. Prove that there exist 0 < a < B < 1 such 


B 
that f f(x) dx = 0. 


Find all functions f : R — R having the intermediate value property such 
that fl! + fll is increasing and there exists such positive integer m that 
fll + fll +... + fll is decreasing. 

Note that fl”! = fofo---of is the nth iterate of f (with fl! = 1p, the identity 
function on the reals) and that we call increasing (respectively decreasing) a 
function h with property that h(x) < h(y) (respectively h(x) > h(y)) whenever 
x<y. 

Let f : R — R be a continuous function and let a,b € f(R) (ie., in the range 
of f) with a < b. Prove that there exists an interval J such that f() = [a, b]. 
Let f : [0, 1] > [0, co) be a continuous function such that f(0) = f(1) = 0 and 
f(x) > 0 for 0 < x < 1. Show that there exists a square with two vertices in the 
interval (0, 1) on the x-axis and the other two vertices on the graph of f. 

Let a1, .. . , an be real numbers, each greater than 1. If n > 2, prove that there is 
exactly one solution in the interval (0, 1) to 


n 


[ [a —x%)=1-x. 


j=l 


Solutions 


1. 


We prove that if Z is an interval, then f(/) is an interval. If zero is not an 
accumulating point of J, then f is continuous on J and consequently, f (I) is 
an interval. 

Obviously, f(R) € [—1, 1], so it is sufficient to prove the equality 


F((O, €)) = [-1, 1], 


1 


for all £ > 0. Indeed, for positive integers n > e~*, we have 
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= > e (0,€) 
5 +2nīr —5 4+ 2nn 


1 1 
(z) =h s(a) TE 


1 1 
Now, using the continuity of f on the interval ‘ , we 
3 da Eee i | 


and 


deduce that f((0, ¢)) = [—1, 1]. 


. The answer is no. The function f : [0, 1] —> [0, 1], given by 


2x, x € [0, 1/2) 


oe Panera 


is surjective but it does not have IVP because 


(G3) -[eJl 


. First we prove that 


1 1 
YneN*, Ja, E€ (0. D), such that |f(a,) —f(0)| < -, 
n n 
where ay = 1. Indeed, if there is nọ € N* for which 


FG) —fO|>—,. vze (o, E), 
no 


no 


then 


FE (-c0.r00 z J U FOU [ro + Z o) l 


1 1 

for all x € |o, 2) . This is a contradiction, because f ([°. ~)) cannot be an 
No no 

interval. 


Reciprocally, if a > 0 and à € (f(0),f(a)), then from lima, = 0, we 
noo 

conclude that there exists nı € N such that a, < a, V n > nı. Because 
lim f(a,) = f(0) < A, there is ny € N such that f(a,) < A, Y n > m. Now, 
n—> 00 
for no = max{n;, m}, we have an <a, f(an) < A. Further, f is continuous 
on [ana] and f (an) < A, f(a) > A, so there exists c € (an, a) C [0, a] such 
that f(c) = À. 
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4. IfI C [0, oo) is an interval having the origin as an accumulating point, then 
(a,b) € f(D C [a,b], 


so f (I) is an interval. To this end, we will use the previous result. Let nọ be a 
positive integer such that 


(=) < f(0) =f (=> r) TEO 


1 1 1 
Note that f is continuous on | ———, — |, so we can find a, € | ———, — 
2n+1 2n 2n+1 2n 
for which f (a„) = f(0). In conclusion, a, —> 0 and f(a,) —> f (0), as n —> oo. 
5. Assume that f(a) < 0 and f(b) > 0, without loss of generality. Let € > 0 be 


such that 


fx) <0, Vxelaate], f(x) >0, Vxe [b-«e,)] 
and let us consider the arithmetic progressions 
ay <a. <+++<a,€[aate], by < b<- <b, € [b-«,D]. 
Define F : [0, 1] > R by the formula 
F(t) = X F(C = tax + thy), t € [0,1]. 
k=1 
The function F is continuous and 
FO) =X fa) <0, FO) = >> f(b) > 0, 
k=1 k=1 

so we can find t € (0, 1) such that F(t) = 0, i.e., 

XOA = tax + thy) = 0. 

k=1 


Finally, we can take the arithmetic progression x, = (1 — t)ag + Ty. 
6. If we assume by contradiction that such a function exists, then define 
g : R > R, by g(x) = e™f (x). Its derivative 


e™ sinx, x € (—oo,0) 
e™ cosx, x € [0, 00) 


R —f@) = 
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has IVP. This is impossible, because g’ has at zero a discontinuity of the first 
kind, 


li i = li X si = (0) 
mhi g (x) lime "sinx 
and 


lim g'(x) = lim e™ =1. 
Tere) lim e™ cosx 


. From f(cos? x) = cos?(f(x)) € [0,1], x € R, we deduce that y € [0,1] = 


f(y) € [0, 1]. This fact allows us to define the function g : [0,1] — [0, 1], by 
g(x) = f(x), x € [0,1]. Obviously, g(g(x)) = cos? x.g is injective and g has 
IVP, so it is strictly monotone. Therefore, g o g is increasing, a contradiction. 


. The function f is injective and f has IVP, so f is continuous and strictly 


monotone. If f(x) = a, for some real a, then 
FF(@)) =f@) > 2% =a, 
which is impossible. If f(x) < x, for all real numbers x, then 
SEQ) <fa) <x> 2% <x, VxER, 


a contradiction. Consequently, f(x) > x, for all real numbers x and further, f is 
increasing. The limit? = lim f(x) does exist. 
x—>—OO 
Because f(x) > 0 for all x, we have / € R and f(/) = 0. 


Finally, from the monotonicity, we conclude that f(/ — 1) < 0, a contradic- 
tion. 


. Im f is an interval and f(y) = y’, for all real numbers y in Im f. Indeed, if 


y = f(x), for some real x, then 


FO =fF@) =PO Sy. 


We prove that Im f = R and so f(x) = x°, for all real numbers x. To this end, 
define u; = —2, vı = 2, 


3 3 
Unt1 = Up, Unti=V,z, n=l. 


It is easy to see that u, — —oo and vn —> œ as n — ov. Finally, note that 
Un, Un E Imf for all positive integers n. Indeed, if u, € Im f, un = f(a), then 


Unyi = u, = f (œ) = f(f(@)) € mf. 
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10. The function F : [0,1] > R, given by 


11. 


(1-1)b+1d 
ro= f f(x) dx, t € [0,1] 


—t)a+te 


is continuous, and consequently it has IVP. According to the hypothesis, 


b 
F(0) =| f(x) dx <0 


and 


d 
F(1) = I f(x) dx > 0, 


so F(t) = 0 for some t in (0, 1). More precisely, 


(1—t)b+td 
f(r) = Í Hiasi, 


1—t)a+tc 


so we can choose a = (1 — t)a + tc, P = (1 — t)b + td. 
Let f, = f +f +...4 f” for every n > 0. Thus, the problem gives us that, 
for all x,y € R, with x < y, we have fi (x) < fi(y), and fin(x) = fn). 

First, f is injective. Indeed, let x, y be real numbers such that f(x) = f(y), and 
assume, without loss of generality, that x < y. Clearly we have f(x) = f") 
for all n > 1; thus f(x) > fa) becomes fll(x) > fll(y), that is, x > y, and, 
consequently, we have x = y. Being injective and with the intermediate value 
property, f must be strictly monotone, which implies that f?! = f o f is strictly 
increasing; therefore, f”! is strictly increasing for all n > 0. Thus 


fin =f th ofh te +f of H phn 


is strictly increasing, while 


fant =A thi o fP +e tf; o fP 


is increasing. 

Because f!”! is decreasing, it follows that m is odd, and f”! is constant (being 
decreasing and increasing at the same time). But, clearly, this can happen only 
if fi is in turn constant. This means that there exists some c € R such that 
fix) = c > f(x) = —x + c for all x € R, and, indeed, the reader can easily 
check that these functions satisfy all conditions from the problem statement. 

This was proposed by Dorel Mihet for the Romanian National Mathematics 
Olympiad in 2014. 
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There exist real numbers p and q such that f(p) = a and f(q) = b. Of course, 
p # q, and we can assume, without loss of generality, that p < q. The set of 
those x in [p, q] for which f(x) = a is bounded and nonempty (it contains p); 
thus it has a supremum, and let it be œ. The set of those x € [a,q] for which 
f(x) = q is bounded and nonempty (it contains q); thus it has an infimum, and 
let it be $. By passing to the limit, and using the continuity of f, we see that 
f(a) = a, and f(B) = b. We claim that f([a, B]) = [a, b]. 

The inclusion [a, b] € f([a, B]) follows immediately from the fact that f has 
the intermediate value property. Conversely, let us prove that f([a, B]) © [a, b]. 
Let x € [a, £], and suppose that f(x) < a, so that we have f(x) < a < b= 
f(B). Again by the intermediate value property, we obtain the existence of some 
y € (x, B), with f(y) = a, but this contradicts the supremum property of a. 
Similarly, the assumption that f(x) > b (and thus f(x) > b > a = f(a)) leads 
to a contradiction of the infimum property of 6. It remains the only possibility 
that f(x) belongs to [a, b], finishing the proof. 

This problem was also proposed for the 1 1th grade in the Romanian National 
Mathematics Olympiad, in 2007. 

The set 


A={xe [0,1]: x+f@>2 


is nonempty (it contains 1) and, of course, bounded; hence it has an infimum, 
and let it be c. Since 0+f (0) < 1 andf is continuous, the inequality x+f(x) < 1 
holds for all x in a neighborhood of the origin (and in [0, 1], of course); therefore 
c > 0. Yet we have x+f(x) < 1 for all x € [0, 1], x < c, andx+f(x) > 1 for all 
x € A (and A contains a sequence with limit c), and these relations immediately 
yield c + f(c) = 1. To summarize: there is a c € (0, 1] such that c + f(c) = 1 
and x + f(x) < 1 for all x € [0, c). 

Since f is continuous, it is bounded and attains its extrema on the interval 
[0, 1]. Thus there is a d € [0, 1] such that f(x) < f(d) for all x € [0, 1]; actually 
the conditions from the problem statement imply that d € (0, 1). 

Now let the continuous function g : [0,1] —> [0, 00) be defined by g(x) = 
x + f(x). Since g(0) = 0 < d < 1 = c + f(c) = g(c), there exists a € (0,c) 
such that g(a) = d © a+ f(a) = d (because g has the intermediate value 
property). 

Now consider the function A : [0, c] —> R defined by h(x) = f(x+f(x))—f(@) 
for all x € [0, c] (for which 0 < x + f(x) < 1, thus A is well defined); certainly, 
h is also a continuous function. We have 


h(a) = f(a + f(@)) —f(@) = f(d) —f(@) = 0, 
A(c) = f(e +f) —fO = fA) -—f) = FO < 0, 
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14. 


and also, 


h(d) = f(d + f(d)) -f (d) < 0. 


The last relation holds (at least) if d < c. 

Now, as h has the intermediate value property, there exists y € [a,c] such 
that h(y) = 0 & f(y + f(v)) = f(y), and œ > 0 assures y > 0. However, we 
cannot be sure of the fact that y < 1. If y is exactly 1, then c = 1 follows as 
well; therefore d < c, and we can find another y in the interval [œ, d] with the 
property that A(y) = 0, and in this case, we have y < d < 1. Either way, there 
exists y € (0, 1) such that h(y) = 0, that is, f(y + f(y)) = f(y). 

Finally we see that the quadrilateral with vertices (y, 0), (y + f(y), 0), (vy + 
fO). fv + f(y))), and (y,f(y)) is a square (the first two points are on the 
x-axis; the others are on the graph of f). 

This is problem 11402 proposed by Catalin Barboianu in The American 
Mathematical Monthly. 

We first use (a form of) Bernoulli’s inequality, namely, 1 — x* < a(l — x) for 
a > 1 and x > 0. Thus, for x € (0, 1), we have 


n 


fw = [[q — x“) < ajte a(l — x)" < 1—x, 


j=l 


the last inequality being satisfied if we choose x > 1 — 1/"4/a1 +++ Gn. 

On the other hand, the inequality (1 — sı): (1 — sn) > 1— sı —---- 
Sn (somehow also related to Bernoulli’s inequality) can be proved by an easy 
induction for s;,...,5, € (0, 1). Thus, for x € (0, 1), 


f@) =] Jd —2%) > lax" Ma EE 


j=l 


where the last inequality holds if we choose x < (1/n)!/")), witha = min aj. 
<j<n 


The intermediate value theorem applied to the function x  f(x)—(1—x) shows 
that there is at least one solution in (0, 1) to the equation f(x) = 1 — x. 

Now suppose that the equation f(x) = 1 — x has two solutions x, and x2, 
with 0 < x; < x) < 1, and let g(x) = logf(x) — log(1 — x), and h(x) = 
(1 — x)g' (x). Of course, 0, x; and x. would be solutions for g(x) = 0, as well, 
therefore, by Rolle’s theorem, g'(x) = 0, and consequently, h(x) = 0 would 
have at least two solutions in (0,1) (one between 0 and xı, and one in the 
interval (x), x2)). Finally, this would produce at least one solution in (0, 1) for 
the equation h’(x) = 0 (again by Rolle’s theorem). But we have 


h(x) = — oa! — x) = x9)! + 1, 


j=l 
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and 
h(x) = — ax 4 —l-a(x-1))d =o) ee 
j=l 


hence we see that h’ is negative in (0, 1), according to the same inequality of 
Bernoulli mentioned in the beginning. The contradiction thus obtained finishes 
the proof. 

This is problem 11226, proposed by Franck Beaucoup and Tamas Erdélyi in 
The American Mathematical Monthly 6/2006, with the solution of the Microsoft 
Research Problem Group in the same Monthly, 1/2008. Yet another (a bit more 
complicated) solution can be found in Chapter 14 about derivatives and their 
applications. 


Chapter 12 
The Extreme Value Theorem 


The extreme value theorem asserts that any continuous function defined on a 
compact interval with real values is bounded and it attains its extrema. Indeed, let 
us consider a continuous function f : [a,b] —> R. If we assume by contradiction 
that f is unbounded, then for each positive integer n, we can find an element x, in 
[a,b] such that |f(x,)| > n. In this way, we define a bounded sequence (xn)n>1- 
The Bolzano-Weierstrass theorem implies (due to the compactness of [a, b]) the 
existence of a convergent subsequence (xk,)n>1. We have |f(xx,)| > kn, for all 
positive integers n. In particular, the sequence (f(x;,))n>1 is unbounded. This is a 
contradiction, because the sequence (f (xk,))n>1 is convergent to f(/), where / is the 


limit of the sequence (x;,)n>1. Then suppose that M = sup f(x) is the least upper 
x€[a,b] 
bound (or supremum) of f (which is finite, as shown above). There exists a sequence 


(Un)n>1 © [a,b] such that lim f(u,) = M. By Bolzano-Weierstrass theorem again, 
7 n—>>Co 


(Un)n>1 has a convergent subsequence; let us call (ux, )n>1 this subsequence, and let 
c € [a, b] be its limit. We then have M = lim f(u,) = lim f(ux,) = f(c), and thus 
n=O n—>Co 
f attains its supremum M. In a similar manner, we show that f attains its infimum, 
thus finishing the proof of the theorem. O 
The fundamental reason for which the above proof works lies in the fact that a 
continuous function carries compact sets into compact sets. Thus if we consider a 
compact metric space (X, d) and a continuous function f defined on X and having 
real values, f will also be bounded and will achieve its extrema, because its image 
will be a compact connected set, thus a compact interval. For the special case 
discussed in the beginning of this chapter, we present another interesting proof. 
Using the continuity of f, for every x in [a,b], we can choose ex > O and an 
open interval Iy = (x — &y,x + Ex) containing x such that |f) —f(@)| < 1, for 
all real numbers y € Z N [a, b]. Further, with the notation M; = |f(x)| + 1, we have 
FO) < M, for all y € I. The family (Z,)xeja,5] is an open cover of the compact set 
[a, b]. Consequently, we can find x1, x2,...,X, € [a, b] such that 


[a, b] C Iy, UL Ue Uk. 
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Now, one can easily see that f is bounded in absolute value by 
max{M,,,M,,,...,Mz,}- 


For the second part of the proof, we proceed as above. Alternatively, one can 

consider the function x +> g(x) = 1/(M—f(x)), with M = sup f(x). If we assume 
x€ [a,b] 

f(x) < M for all x € [a, b], then g is continuous, thus bounded on [a, b], according 

to the first part of the theorem. However, this cannot happen, as there are values of 

f as close to M as we want. O 

Here is a spectacular application of this theorem in linear algebra: 


Theorem. Any symmetric real matrix is conjugate to a real diagonal matrix. 


Proof. Indeed, take A to be a symmetric matrix of order n and define the 
quadratic form g(x) = (Ax,x), where (-) is the standard Euclidean scalar product 
on R”. This is clearly a continuous function in R” with real values. Consider S"7!, 
the unit sphere of R”. This is a compact set, because it is clearly bounded and 
closed. Therefore, q has a maximum À on $”~!, attained at a point v. We will 
prove that A is an eigenvalue of A. Indeed, by the definition of A, the quadratic 
form qi(x) = Allx||? — q(x) is positive and vanishes at v. The Cauchy-Schwarz 
inequality implies b(u, v)? < q,(u)qi(v) for all u, where b is the bilinear form 
associated to qı. Thus b(u, v) = 0 for all u and this implies Av = Av. Thus we 
have found an eigenvalue of A. Because A is symmetric, the subspace orthogonal to 
v is invariant by A, and thus applying the previous argument in this new space and 
repeating this allow us to find a basis consisting of (orthogonal) eigenvectors. Thus 
A is diagonalizable. LI 


If you are still not convinced about the power of this result, consider its use 
in the following proof of the celebrated fundamental theorem of algebra, due to 
D’ Alembert and Gauss: 


Theorem. Any nonconstant polynomial with complex coefficients has at least 
one complex zero. 


Proof. Consider 
P(X) = a,X" + an- 1X"! + +aX +a 


a complex polynomial and suppose that n > 1 and a, 4 0. Let f(z) = |P(z)|. 
Because 


f(@ = larllz|" = lana lz"! — +++ = Jaullzl — laol, 


it follows that lim f(z) = oo. In particular, there exists M such that for all |z| > M, 


lel > 00 
we have f(z) > f(0). Because f is continuous on the compact disc of radius M and 
centered at the origin, its restriction to this compact set has a minimum at a point 
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zo such that |zo| < M. Now, if |z| > M, we have f(z) > f(0) > f(z). Otherwise, 
we know that f(z) > f(zo). Thus f attains its minimum at zọ. Suppose that P does 
P(z + zo) 
P(zo) 
polynomial and g(z) = |Q(z)| attains its minimal value, equal to 1, at 0. Let 


not vanish on the set of complex numbers. Consider Q(z) = . It is still a 


O(z) = 14+ bd +-+ brz, 


where b; # 0. Let b; = re’ with r > 0 anda € R. The triangle inequality shows 
that 


n 
i(t —a) g 7 
lo (ee 7 )| <11- rel + lle, 


k=j+1 


for all € > O such that € < min (1, 1). Thus, for € > 0 sufficiently small, we have 


i(n —a) 
lo (ee j mee 


which is a contradiction. This shows that necessarily P must vanish at some z € C. 
oO 


Let us continue with some concrete examples of application of the extreme value 
theorem: 


Problem. Find all continuous functions f : [0, 1] —> R such that 


fx) = yp’ Y x € [0,1]. 
n=1 


Co 
1 
Solution. Because 5 a 1, any constant function is solution of the equation. 
n=1 
Now let f be a solution, and consider 0 < a < 1 and x € [0, a] such that f(x) = 
mex f. Because x” € [0, a] for all n, we have f(x") < f(x) for all n and because 
0,a 


fo =. 
n=1 


we must have f(x") = f(x) for all n. Take n —> œœ to obtain f(x) = f(0). Therefore 
f(O) > f(@ for all t € [0, a] and all a < 1; thus f(0) = at fe But if f is a solution, 


-f is al lution; thus —f (0) = —f) = —minf. Thus, minf = , and 
-f is also a solution; thus —f(0) mae -f) mune us me ian: an 


f is constant. O 
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Here is a problem from the Putnam Competition: 


Problem. Let/, g : [0,1] — Rand K : [0, 1]<[0, 1] —> R be positive continuous 
functions and suppose that for all x € [0, 1], 


ih, 1 
f= i ked 20) = J K(x. yf dy. 


Prove that f = g. 


Solution. We may assume that 


a = min— <b=min® 
0.11 g [0.1] f 
_ fo) 
and let xo € [0, 1] be such that a = Gay Thus g(x) > af (x) for all x. Also, 
&\X0 


1 
f 60.960) = af) dy = Fl) — ago) = 0 
0 
and because y > K(x, y)(g(y) — af (y)) is a continuous nonnegative function with 


average 0, it must be identically 0. Thus, since K(xo,y) > O for all y, we have 
g = af. But then 


ih, 1 
Hoe i KES sO Se J K(x, )f O)dy = ag(s) = F(x): 


thus a = 1 andf = g. O 


We end this theoretical part with a nontrivial problem taken from the American 
Mathematical Monthly: 


Problem. Let f : R — R be a twice continuously differentiable function such 
that 


2f(x + 1) = f(x) + f(2x) 
for all x. Prove that f is constant. 


Solution. With x = 0 and x = 1, we obtain f(0) = f(1) = f(2). Let F = f”, 
and differentiate the given equality twice to obtain 


2F(« + 1) = F(x) + 4F(2x). 


Thus 
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Now take a > 2 and let J = [—a,a]. Let M = max |f(x)|. Clearly, if x € I, we 
xel 


have 5 € Jand 1+ > € J. Thus 


Folsp+t= 
-M M_ 3M 
4. 4 


3M 
for all x € Z and so M < a hence M = 0. Therefore F is identically 0 and f is 


of the form ax + b. Because f(0) = f(1) = f(2), we deduce that a = 0, and so f is 
constant. L 


Proposed Problems 


1. Let f, g : [a,b] > R be continuous such that 


sup f(x) = sup g(x). 


x€[a,b] x€ [a,b] 


Prove that there exists c € [a, b] such that f(c) = g(c). 
2. Let f, g : [a,b] — R be continuous such that 


inf FG) = inf gorz a g(x) < sup f(x). 


x€[a,b x€ [a,b] 


Prove that there exists c € [a, b] such that f(c) = g(c). 
3. Let f : [0,co) —> R be continuous such that the limit lim f(x) = l exists and 
x00 


is finite. Prove that f is bounded. 
4. Let f : [0, 00) — [0, cc) be continuous such that lim f(f(x)) = oo. Prove that 
xX 0O 
lim f(x) = 
X—>OCO 
5. Let f : R — R be a continuous function which transforms every open interval 
into an open interval. Prove that f is strictly monotone. 
6. a) Prove that there are no continuous and surjective functions f : [0,1] —> (0,1). 
b) Prove that there are no continuous and bijective functions f : (0, 1) > [0, 1]. 
7. Letf : R — R be continuous such that 


f@) —fO)| = lx- vyl 
for all real numbers x, y € R. Prove that f is surjective. 
8. Let : R — R be continuous and 1-periodic. 


a) Prove that f is bounded above and below, and it attains its minimum and 
maximum. 
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13. 


14. 
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b) Prove that there exists a real number xo such that f (xo) = f(%o + 7). 


. Prove that there exists € > 0 such that 


|sinx| + |sin@ + 1)| > e, 
for all real numbers x. Deduce that the sequence 


|sinl| — |sin2| |sin n| 
a= pe SE naz, 
1 2 n 


is unbounded. 
Let f : [0, 1] — [0, 00) be a continuous function. Prove that the sequence 


i 1\" 2\" n\n 
a,=,/fl-) +f- +f (2). 22 
n n n 
is convergent. 


Let f : R — R be a continuous function. Assume that for any real numbers a, b, 
with a < b, there exist c1, c2 € [a, b], with cı < c2, such that 


f(a) = AR (x), fle) = ay (x). 


Prove that f is nondecreasing. 
Let f : R — R be a continuous function. Assume that for any real numbers a, b, 
with a < b, there exist c1, c2 € [a, b], with c1 Æ cz, such that 


f) =f(e2) = mee): 


Prove that f is constant. 

Let f : R — R be a continuous function. Assume that for any real numbers a, b, 
with a < b, there exists c € (a,b) such that f(c) > f(a) and f(c) > f(b). Prove 
that f is constant. 

Let f, g : [a,b] — [a, b] be continuous such that f o g = g of. Prove that there 
exists c € [a, b] such that f(c) = g(c). 

Let (X, d) be a metric space and let K C X be compact. Prove that every function 
f : K — K with the property that 


d(f(x).f(y)) < dy) 


for all x,y € K,x Æ y has exactly one fixed point. 
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1. Let us define the function h : [a,b] —> R by the formula h(x) = f(x) — g(x). 
The function A is continuous as a difference of two continuous functions. By the 
extreme value theorem, we can find a, f € [a, b] such that f(@) = (6) = M, 
where M denotes 


M= sup f(x) = sup g(x). 


x€[a,b] x€[a,b] 


We have f(x) < M and g(x) < M for all real numbers x € [a, b] . Thus 


h(a) = f(a) — g(a) =M— g(a) = 0 


and 


hP) = f(B) — s(B) = f(B) —M <0, 


so there exists c € [a, b] such that h(c) = 0 & f(c) = g(c). 
2. Leta, p € [a,b] be such that 


fla) = int f, f) = sup fC). 


x€ [a,b] 


By hypothesis, f(a) < g(x) < f(B) for every real number x € [a, b]. Define 
the function h : [a, b] > R by h(x) = f(x) — g(a). The function h is continuous 
as a difference of two continuous functions. Then h(a) = f(a) — g(a) < 0 
and h(6) = f(B) — g(B) = 0, so there exists c € [a, b] such that h(c) = 0 > 
Fc) = sg). 

3. For € = 1 in the definition of the limit, there exists 6 > 0 for which |f(x) — I| < 
1, for all real numbers x € [5,00). It follows that |[f(x)| < 1 + |/| for all x € 
[5, 00). Also the restriction f|jo,s} : [0, 5] — R is a continuous function defined 
on a compact interval, so it is bounded, say |f(x)| < M, for all x € [0,6]. 
Consequently, |f(x)| < max{M, 1 + |/|} for all nonnegative real numbers x. 

4. Let us assume, by way of contradiction, that there exists a strictly increasing, 
unbounded sequence (a,),, C [0,00) such that f(a,) < M, for all positive 
integers n and for some real number M. By the Bolzano-Weierstrass theorem, 
we can assume that the sequence (f (an))„>ı is convergent (by working 
eventually with a subsequence of it). If hm. f(an) = L, then by using the 


continuity of f at L, we obtain lim f(f(a,)) = f(L), which contradicts the fact 
n> 
that lim f(f(x)) = oo. 
xX CO 
5. Because any injective continuous function is strictly monotone, it is sufficient 


to prove that f is injective. Let us assume, by way of contradiction, that there 
exist two real numbers a < b such that f(a) = f(b). The function f cannot 
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be constant on [a, b] , because in this case, for every a < c < d < b, the set 

f ((c, d)) = {f (a)} is not an open interval. Thus the restriction f |ja,ġ] attains its 

minimum or maximum on (a, b) , say an f(x) = f(E), for some real number 
x€[a,b 


E € (a,b). In particular, f(x) > f(E), for all reals x € (a,b), so f (a,b) 
cannot be an open interval. This follows from the fact that the set f ((a, b)) 
has a minimum and open intervals do not have minima. 


. a) If such a function exists, then f((0, 1]) = (0,1). This contradicts the fact 


that f ([0, 1]) = [m, M] , where 


m= inf f(x), M= sup f(x), 
xeļo.1] x€(0,1] 


which is a consequence of the extreme value theorem. 

b) Suppose that such a function exists. The function f is continuous and 
injective, so it will be strictly monotone, say increasing. By surjectivity, there 
exists c € (0, 1) such that f(c) = 0. Then for every real number 0 < x < c, 
by monotony, it follows that f(x) < f(c) = 0, which is impossible. 

Alternatively, if a function f has the given properties, then its inverse 
f~! : [0,1] — (0, 1) is continuous and surjective, which contradicts a). 


. Ifx1,%2 € R are so that f(x,) = f (x2), then from the relation 


Fœ) —F2)| = ki — x2]. 0 2 ei — x], 


it follows that xı = x2. Hence f is injective. But f is continuous, so it is strictly 
monotone. For y = 0, we obtain |f(x)| > — [f (0)| + |x], so | pe If (x)| = æ. 
Xx|—>0oO 


For example, if f is strictly increasing, then 
lim f(x) =—oo, lim f(x) = œ 
X—>—00 X00 


and by continuity, f is surjective. 


. a) Observe that because of the periodicity of f, we have f(x) = f (x— [x]) for all 


real numbers x. This shows that the range of f is exactly f ([0, 1]). This is a 
compact interval by the extreme value theorem, so f is bounded and attains 
its extrema. 


b) Let us define the function ¢: R —> R 


wm 


p(x) = f(x + 1) —fO), 


for all real numbers x. The function f is continuous, so ¢ is continuous. If we 
show that the function ¢@ takes values of opposite signs, then the problem is 
solved. Let æ be such that f(a) < f(x) for all real numbers x. The existence 
of such a number follows from a). Then 


ġa) =f(a+n)—fa)=0 
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and 
pla- x) =f(a)—f(a—x) <0. 
9. Let us consider the continuous function f : [0,27] > R, given by 
f(x) = |sinx| + |sin@ + 1)|. 


Let xo € [0, 27] be such that f(x) = min T: We prove that € = f (xo) is 
xE[0, 27 


positive. Indeed, £ > 0, and if £ = 0, then 
|sin xo| + |sin(xo + 1)| = 0 
=> |sinxo| =O and |sin(xo + 1)| = 0. 
We can find integers m,n such that x» = maz, xo + 1 = nz, so 


1 
mr =nr — l > r = —, 
n—m 


which is impossible because the above fraction on the right is smaller than 1 in 
absolute value. Now, for this €, we have 


|sinx| + |sin@ + 1)| > £, Vx € [0, 27]. 


Moreover, this inequality holds for all real numbers x, because sin is 2x- 
periodic. For the second part, note that 


3 [suet (eita 3! [am i.o eal i 
on Neo 2 3 4 


sin(2n — 1 sin 2n 
eh ee) 


2n—1 2n 


|sinl| + |sin2| — |sin3| + |sin 4| |sin(2n — 1)| + |sin 27| 
+ fe a ate 
2 4 2n 


AR Seah E ae fin 
2 4 2n 2 2 n)’ 


and the conclusion follows from the fact that 


1 1 
lim (14 54-42) =o 
noo 2 n 
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One can prove (but the proof is much more involved) that if f is a continuous 
1 2 
va FO 


periodic function such that the series 5 


- converges, then 


f(n) = 0 for all positive integers n. 
We will prove that lim a, = M, where M = me f(x). Assume that f (xo) = 
n—>0oo x€[0,1 


M. Let € > 0. There exists 5 > 0 for which 
f(x) >M—e, Vx € (x — ô, xo + 8) [0, 1]. 


For each n > 1/65, we can find 1 < kọ < n such that 


ki 
2 e Gey 8, xo + 8). 
n 


Then 
5 1\” 2\" n 
a 710 +#(3) PG] 
2G) a(t) 2m 
n n 
We derive 


M—e<a,< %/nM 


for all integers n > 1/6; thus (a,),>1 converges to M. 
Let us assume that f(a) > f(b) for some a < b. Define 


A= {x € [a,b] | fŒ) 2 f@}. 


A is nonempty, a € A, so let c = supA. If (an)n>ı1 C A is a sequence that 
converges to c, then f(a,) > f(a). By taking n > oo, we obtain f(c) > f(a); 
thus c < b. 

If (ra)n>1 C (c, b), then f (ra) < f(a), and moreover, if (rn)n>1 converges to 
c, we deduce that f(c) < f(a). In conclusion, f(c) = f(a). Now, let c < c1 < 
c2 < b be such that 


f(c) = mny (x), f(c2) = mor (x). 


Because f(c) > f(x), for all x in (c,b], we must have cz = c, which is 
impossible. 
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13. 


14. 


15. 


Let M = ma f(x) and define the set 
x€la, 


A = {xE [a,b] | f(x) = M}. 


Let c = inf A; then f(c) = M. We will prove that f(a) = M. Indeed, if f(a) < 
M, then a < c, and the assumption of the problem fails on the interval [a, c]. 
We proved that for alla < b,f(a) = a f(x) and analogously, f(b) = 
x€[a,b 


max f(x). In conclusion, f is constant. 
x€[a,b] 


Let a < band let cı € [a, b] be such that f(c)) = ee f(x). According to the 
x€[a,b 
hypothesis, we can find c2 € (c1, b) such that f(c2) > f(c1) and f(c2) > f(b). 
Thus f(ci1) = f(c2) = nae f(x). The conclusion follows from the previous 
x€la, 

problem. 

We have to prove that the function h : [a,b] —> R given by h(x) = f(x) — 
g(x) has a zero in [a, b] . If we suppose that this is not the case, then, because 
the function h is continuous, it keeps a constant sign on [a,b], say h > 0. 
According to the extreme value theorem, there exists xo € [a,b] for which 
h(xo) = D i h(x) > 0. It follows that f(x) — g(x) > A(x) for all real numbers 

x€l[a,b 


x in [a, b] . Thus 


SFC) = s(g@)) = FF) — sFC))] + F(g@) — sew) 
> h(x) iE h(xo). 


By induction, 
fe) = g) > nho), 


for all real numbers x € [a, b] and all positive integers n. The last inequality 
cannot be true, if we take into account that f and g are bounded (they are 
continuous on a compact interval). 

Ifc, € K,c # c are so that f(c) = c, and f(c’) = c’, then for x = c and 
y = C, we obtain 


d(f(c), f(c)) < d(c,c) & d(c, c) < d(c, c’), 


a contradiction that proves the uniqueness of a fixed point of f (if any). Let us 
define the application ¢ : K — R by the formula ¢(x) = d(x, f(x)) for all 
x € K. The function ¢ is continuous, because f is continuous. According to the 
extreme value theorem, there exists x9 € K such that ¢ (xo) = min (x). Hence 
d(xo,f(xo)) < d(x, f(x)) for all x € K, and we will prove that f(xo) = xo. If 
f (xo) Æ xo, then by hypothesis, 


af (x0), fE (X0))) < dxo, f (x0)) + PF %0)) < Po), 


which contradicts the minimality of ¢ (xo). 


Chapter 13 
Uniform Continuity 


We say that a function f : D C R — R is uniformly continuous if for all € > 0, 
there exists v(e) > 0 such that: 


x,x’ eD, 


x—-x|<v(e) > |f@-f@)| <e. 


Note that this is equivalent to the fact that for all € > 0, there exists v(e) > 0 such 
that: 


x, x €D, 


x= x'| < v(e) > Fœ -= fœ)| <E. 


Also, it is easy to deduce that every uniformly continuous function is continuous, 
but the converse is not always true, as we can see from the following example. 


Example. The function f : R > R, f(x) = x? is continuous, but it is not 
uniformly continuous. 


Proof. Iff is uniformly continuous, then |f (x) —f(»)| < €, for all real numbers 
x,y, with |x— y| < v(e). In particular, for € = 1, we can find v > 0 such 
that |f(x) —f(y)| < 1, for all real numbers x, y, with |x — y| < v. Consequently, 
FO + v) —f(y)| < 1, for all real numbers y. Therefore 


(y+v)y-y <1 |Qvyt+v7|<1 


for all y, which is clearly impossible. L 
The following theoretical result is a criterion for uniformly continuous functions. 
Proposition. Let f : D C R —> R. The following assertions are equivalent: 


a) fis uniformly continuous. 
b) for all sequences (Xn)n>1, (Yn)nz1 C D with lim (xn — yn) = 0, we have 
noo 
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lim (f(%n) —fOn)) = 0. 
n—>Co 


Proof. a)=b). Let e > 0. The uniform continuity of f implies the existence of 
v(e) > 0 such that 


lf@) -fO < e, (13.1) 


for all real numbers x, y € D, with |x — y| < v(e). Because lim (xn — yn) = 0, there 
noo 


exists a positive integer n(e) such that |x, — y,| < v(e), for all integers n > n(e). 
Using (13.1), we derive |f (xn) —f(vn)| < £, for all n > n(e), so 
Him Fn) —fOn)) = 0. 


b)= a). Assuming by contradiction that f is not uniformly continuous, we deduce 
the existence of a real number £ọ > 0 so that 


Vv>0, Jx, Yv: [xs = yyl <v, Fæ) —fOv)| = £0. 


Ifv = 


ale 


,n > 1, then we can find sequences (Xn)n>1, (Yn)n>1 for which 


1 
Xn —Yal < a and [f(%n) —fQn)| = £0, 


for all positive integers n. This contradicts lim f(x) —f(,)) = 0.0 
noo 


We recall that f is a Lipschitz function if 


f@) -fO s Ll|x- yl, 


for all x,y in D and some positive real L. It is immediate that every Lipschitz 


function is uniformly continuous: for each € > 0, it is enough to take v(e) := 5. 
One method to recognize a Lipschitz function is to verify that it has a bounded 


derivative. In this case, if sup |f’(x)| = L < oo, then, using Lagrange’s mean value 
xED 
theorem, 


®© -SOI = [FON x- yl s L- |x- yl. 


As a direct consequence, a differentiable function with a bounded derivative is 
uniformly continuous. 

We have seen that uniform continuity implies continuity, but the converse is not 
true. However, we have the following useful result. 


Theorem. Jff : D — R is continuous and D is compact, then f is uniformly 
continuous. 
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Proof. Assuming the falsity of the conclusion, we can find ¢9 > 0 and sequences 
(Xn)n>1; On)nz1 for which (Him On — yn) = 0 and |f(%)—fOn)| = £0, for all 
integers n > 1. 

Because of compactness of D, the sequence (x,),>1 has a convergent subse- 
quence denoted (xk, )n>1. Because jim (Xn — Yn) = 0, the sequence (yg,)n>1 is also 
convergent to the same limit, say a. The inequality |f (xx,) —f(%,)| = £0, therefore, 
contradicts the continuity of f at a. 0 


This result is extremely useful in a variety of problems concerning continuous 
functions defined on compact spaces. Here is a nontrivial application: 


Problem. Let U be the set of complex numbers of absolute value 1 and let 
f : [0,1] —> U be a continuous function. Prove the existence of a real continuous 
function g such that f(x) = e'8 for all x € [0, 1]. 


Solution. We will begin with a definition: we will say that f has a lifting if there 
exists such a function g. First of all, we will prove the following. 


Lemma. Any continuous function f which is not onto has a lifting. 


Indeed, we may assume that 1 is not in the range of f. Then, for all x, there 
exists some g(x) € (0,2m) such that f(x) = e'8°). We claim that this function g 
is a lifting of f; that is, g is continuous. Indeed, fix a number xo and suppose that 
g is not continuous at xo. Thus, there exists a sequence a, converging to xo and an 
€ > 0 such that |g(a,) — g(xo)| = €. Because the sequence (g(a,)) is bounded, it has 
a convergent subsequence, and by replacing eventually a, with the corresponding 
subsequence, we may assume that g(a,) converges to some / € [0,27]. But then 
f (dn) converges to e", and by continuity of f, we must have e” = e'80), which 
means that / — g(xo) is a multiple of 27. But since g(xo) € (0,27), this forces 
l = g(xo), and thus we contradict the inequality |g(a,) — g(xo)| > € for sufficiently 
large n. Thus g is continuous and f has a lifting. 0 


Now, we will employ the uniform continuity of f. Take N sufficiently large 
such that 


(Se) <1 


for all x and all 1 < k < N. This is possible, since the maximal distance between 
foe and Ke (taken over all x) tends to 0 as N — oo and since f is uniformly 
continuous. But then, if we define 


hy (x) = 


216 13 Uniform Continuity 


these functions take their values in U and are not onto, since —1 is not in their range. 
Thus, by the lemma, they have liftings g1, g2,..., gn. By taking some a such that 
f(0) = e", we observe immediately that a + g1 +++: + gy isa lifting for f. 0 

We continue with an easier problem. However, it requires some geometric 
interpretation: 


Problem. Let f : [0, 00) — [0, 00) be a continuous function such that 


f G + ”) gt Oat) 
2 2 


for all x, y. Prove that f is uniformly continuous. 


Solution. First, we are going to prove that f is concave. Indeed, an immediate 
induction shows that 


G +a +e +) fr) +f 2) +++ +f (2) 
f on ee n 


for all nonnegative numbers x1, x2, . . . , X2» and all positive integers n. Thus, by fixing 
some 0 < k < 2” and two nonnegative integers x, y, we deduce that 


k k k k 
sš (1-5)y) = See + (1-5) s00 


But, as we have seen in the chapter concerning density, the set of numbers of the 
form $ with 0 < k < 2” — 1 is dense in [0, 1] and the continuity of f implies the 
inequality 


flax + (1 — a)y) = af(x) + -af O) 


for all x,y > 0 and all a € [0, 1]. That is, f is concave. 

Now, take some sequences a, and b„ such that a, —b, converges to 0 and a, < bn 
for all n and suppose that f(a,) — f(b,) does not converge to 0. Thus, a, — oo 
(otherwise, it would have a convergent subsequence, and the same subsequence of 
b, would converge to the same limit, thus contradicting the continuity of f). Take 
some c > d > | and observe that the concavity of f implies the inequality 


Ave 
fbn) =F (an) < (bn — an) ae 


Now, let us prove that f is increasing. This is quite clear, because for any a > 0, the 
function 


_ fe) -fa 


X—a 


ha (x) 
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is decreasing for x > a (since f is concave) and hg(x) > 2, the last quantity 
converging to 0 as x — oo. Thus h, must be nonnegative on (a, oo), and so f is 
increasing. Finally, this argument combined with the previous inequality gives the 
estimation 


d)— 
0 < f (bn) — Flan) £ (bn -a OO. 


which shows that f(b,) — f (an) converges to 0. Using the theoretical results, we 
finally deduce that f is uniformly continuous. L 


The following problem is much more challenging: 


Problem. Let J C R be an interval and let f : J —> R be uniformly continuous. 
Define 


d(e) = sup{d > 0 | Vx1,%2 E I, |x, —x2| < 6 > |f 1) —f@2)| < £}. 


Prove that: 


a) For all £ > 0 and all x,y € J, 


If) —f0)| < io +e. 


ô 
b) lim da = 0 if and only if f is not Lipschitz on 7. 
Ee—> E 
Solution. a) Let n = eu Everything is clear if n = 0, by definition of 
E 


6(€), so assume that n > 1. Also, suppose that x < y. Let 
xı =x + d(€), x2 = x + 28(€),..., Xn = x + nô(e). 
Then 
If) —fadl s e, Fæ) -fel < e.. En —fO) S e, 
thus 


|x — yl 


ie) EFE. 


If) -fO n+ Ie < 


ô 
b) Let us suppose that f is not Lipschitz and assume that lim ae is not 0 (it may 
E~ g 


E og ; ‘ 
not exist); thus lim HO is not oo, so there is a decreasing sequence £, —> 0 and 
e>0 E 
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En 


5(E n) 


areal number y such that 


—>y.LetC= SUP 5) ‘is Then by a), we 


have 
f(x) -—fO)| Ss C|x— y| + En, Vn, Vxyel. 


For x,y € I fixed, let n —> oo. We obtain |f(x) — f(y)| < Clx — y|; thus f is 
C-Lipschitz, a contradiction. 


Now, suppose that lim O = 0, but f is Lipschitz. Let C > 0 be such that 


f(x) Ff ()| < Clx—y| Sais € I. Take £ > 0. ase x € I with |x; —x9| < a 


ô 
we have |f (x1) —f(2)| < £, thus 6(e) > : > d > This contradicts the fact 
E 
that lim © = = 0.0 


e>0 


We continue with another quite difficult exercise: 
Problem. Let f, g : R — R be continuous functions. Suppose that f is periodic 


ax) 
x 


and nonconstant, and lim = oo. Prove that f o g is not periodic. 
x00 


Solution. Suppose the contrary. Then by proposed problem 13, fog is uniformly 
continuous. Let T > 0 be a period for f and u, v € [0, T] be such that f (u) = miaf, 
0.7 


M— 
fa) = mary. Let f(u) = m, f(v) = M, and £ = —— By uniform continuity, 
0.7 


there is 5 such that 


kyl <8 > fe) —f(e0))| < (13.1) 


We claim that there is an interval of length 6 such that the variation of g on this 
interval is greater than T. Indeed, otherwise 


eo < 18| + 18) = g)| +--+ + Ig(8) — g(@— 04) _ Is] T 
+ 
nô |7 nô ~ nô ô 


and this contradicts the hypothesis made on g. 

Thus, there are x1, x2 € R with |x; — x2| < 6 such that |e (x1) — g(x2)| > T. Thus 
there are a, b between g(x,) and g(x2) such that f(a) = m, f(b) = M. By continuity 
of g, there are x, y between x1, x2 such that a = g(x), b = g(y). Thus |x— y| < 6 and 


—m 


FEE —f(8O))| = Fa) —fO)| = M—m> ae 


which contradicts (13.1). This finishes the solution. O 
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Proposed Problems 


13. 
14. 


15. 


. Prove that the function f : (—1, 1) > R,f(x) = x? is uniformly continuous. 


. Prove that the function f : (0,c0) —> R,f(x) = sin- is not uniformly 
x 
continuous. 
. Prove that the function f : (0,c0) > R,f(x) = sin (x?) is not uniformly 


continuous. 


. Letf,g: R > R,f(x) = x, g(x) = sinx. Prove that f and g are uniformly 


continuous, but their product fg is not uniformly continuous. 


. Let f : DC R— R be a function with the property that 


Ifo) —fO)| S vix—yIl, 


for all x, y € D. Prove that f is uniformly continuous. Is this condition necessary 
for uniform continuity? 


. Let f : D C R — R be uniformly continuous and bounded. Prove that f? is 


uniformly continuous. 


. For what « is the function f : [0, co) > R, f(x) = x” uniformly continuous? 
. Prove that the function f : (a,oo) > R,f(x) = Inx is uniformly continuous if 


and only ifa > 0. 


. Does there exist a uniformly continuous function that is not a Lipschitz 


function? 


. Let f : [0, 1) — R be uniformly continuous. Prove that the function f has finite 


limit at 1. 


. Let f : R — R be continuous, having horizontal asymptotes at +00. Prove that 


f is uniformly continuous. 


. Let f, g : [0, 00) — R be continuous such that 


lim [f() — gw] = 0. 


Prove that f is uniformly continuous if and only if g is uniformly continuous. 
Let : R — R be continuous and periodic. Prove that f is uniformly continuous. 
Let f : R — R be uniformly continuous. Prove that 


œ| < a|x| + b, 


for all x and some a, b. 
Let f : R — R be a uniformly continuous function. Prove that the function 
g : R? > R, given by 


wf (=).x#0 


; x=0 


g(x,y) = | 


is continuous. 
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Solutions 


1. Let (xn)n>1 » @n)nz1 be sequences with elements in (—1, 1) such that 
lim (x, — yn) = 0. 
noo 
Then 


Fn) —fOn)| = [xr = yal = [Xn — Val + On + Yn) S 2 [Xn — Ynl - 


We used the inequality |x, + y,| < 2, which follows from x, yn € (—1,1). 
Now, from the inequalities 


0 < F@n) —fOn)| < 2 [xn — yal, 


we derive lim (f(%) —f(n)) = 0. Hence f is uniformly continuous. 
n—>Co 


In fact, the uniform continuity of f can be established by using the fact that f 
is differentiable with bounded derivative, |f’(x)| < 2 |x| < 2, forall xin (—1, 1). 
On other hand, we can easily see that if a function is uniformly continuous 
on D, then every restriction f|p,, with Do C D remains uniformly continuous. 
In our case, the given function f can be considered as a restriction of the 
function F : [—2,2] —> R, given by the law F(x) = x’. Finally, F is uniformly 
continuous on the compact set [—2, 2]. 
2. We can take the sequences 


Easily, lim (xn — Yn) = 0 and 
n—>>Co 


Him Fn) —fOn)) = hm (sin 2nr — sin (2nx E =)) zd; 


so f is not uniformly continuous. 

This proof uses sequences in a neighborhood of the origin, so a natural 
question is if the restriction g = f|{a,oo), With a > 0, is uniformly continuous or 
not. In this case, we have 


[cos +| 1 1 
acer í 


mA 
ls (x)| a F 


for all x in [a, o0). Hence fliaoo) is uniformly continuous, because it has 
bounded derivative. 
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3. First, note that the derivative f'(x) = 2xcosx? is bounded on intervals of the 
form (0, a], a > 0, so f |(0,a) is uniformly continuous. Indeed, 


Fœ = 2x |cos x°| < 2x < 2a, 
for all x in (0, a]. On (0, 00), we can consider the sequences 


Xn = V2nt, yn = V2nn—1/2, n>. 


We have 
lim (x, —y,) = lim (v2nx — J2nx — x/2) 
n>oo noo 
R 1 
= — lim =0 
2 n> /Inn + \/2nn — 1/2 
However, 


Him Gn) —fOn)) = jim (sin 2nx — sin (2nx — =)) =1, 


with lim (x, — yn) = 0. Thus, f is not uniformly continuous. 
n—>Co 


4. The functions f and g are uniformly continuous, because they have bounded 
derivatives on R, 


’@|]=1, |e’@)| =|cosx}<1, xeER. 


The product function h(x) = xsin x, x € R, is not uniformly continuous. 
Indeed, let us consider the sequences 


1 
X, = 2AN; Yn = 2AT — 7, n>l. 


We have lim (x, — y,) = 0 and 
n—>>Co 


i= tO = (20x _ =) wn (:) 


converges to 277. Hence h is not uniformly continuous. 
5. Let (%n),>1 > Øn)n>ı C D be sequences such that lim (x, — yn) = 0. 
= = n—>Co 


From the inequality 


lf (Xn) =f Yn) < y Xn = ynl, 
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it follows that 
lim [f(%n) —fOn)| = 9, 
n—>Co 


so f is uniformly continuous. Clearly, there are uniformly continuous functions 
which do not satisfy this condition, for example, f(x) = x. 


. Let us assume that for some M > 0, we have |f(x)| < M, for all x € D. We 


then deduce that 


IP?) —F?O)| = rO OLO —FO)| 


so 
I?) —f70)| < 2M - If) -Fo 
Now let (Xn),>1+(n)n>1 C D be any sequences such that 
lim |x, — yn| = 0. 
n—>>Co 
The function f is uniformly continuous, so lim [f(%,) —f(yn)| = 0. Hence 
n—>Co 
lim TAEA) -= F On) | =0, 
n—>>Co 


and the function f? is uniformly continuous. 


. First of all, let a > 1. If f is uniformly continuous, then there is v > 0 so that 


x,y € [0,00), |x- y| < v > hx -y| <1. 
In particular, 
|x + v)* —x*| <1, 

for all nonnegative reals x. This contradicts the fact that 

lim [(x + v)* —x*] = œ, 

Xx—> 0O 
as follows immediately from Lagrange’s mean value theorem. 

For 0 < œ < 1, we can see that f is uniformly continuous on [0, 1] 


(continuous on a compact set) and f’ is bounded on [1, 00). It follows that f 
is uniformly continuous on [0, oo). 
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8. 


10. 


11. 


First assume that a > 0. The function f is uniformly continuous because it has 
bounded derivative, 


1 1 
f@= a < Fi 


for all x € (a, co). In the case a = 0, consider the sequences 


1 1 
Xn = -, Vn Gane. 
n n 
We have 
; . 1 
lim (xn — yn) = lim | --— >] =0 
noo n>o\n n 
and 


1 1 
lim (Qn) —fOn)) = lim (in ——In =) = lim lnn = œ, 
noo noo n n noo 


so f is not uniformly continuous. 


. Yes. An example is f(x) = ./x, x € [0, 00). As we have already proved, f is 


uniformly continuous. We have 


f@)-fO)| _ 
Sina UR 
x—y xy 


Vx = Y 
su — | = su 
; xy | 


x#y 


1 
—.———_ |] = w, 
Tt 


so f is not Lipschitz. 

First of all, it is clear that if (x,),+; converges to 1, then (f(%,))n>1 is a 
Cauchy sequence. Indeed, for € > 0, take v such that |x — y| < v implies 
If (x) —f(y)| < e€. Then (because (x,,),>1 is a Cauchy sequence) for sufficiently 
large m,n, we have |x», — X,| < v, thus |f@,) —f(%m)| < € for all sufficiently 
large m,n. Thus (f(%,))n>1 is a Cauchy sequence, thus convergent. We claim 
that all such sequences have the same limit. This is easily seen by considering 
the sequence x1, y1, X2, y2,... . Call its terms z),2Z,.... Then we know that 
(f Zn))n>1 converges. Since (f(xn))n>1 and (fn))n>1 are subsequences of 
(f Zn))n>1, they must have the same limit. This shows that (f(x,,)),>1 converges 
to a certain / for all (x,),>1 convergent to 1. Thus f has a finite limit at 1. 

Let us denote 


a= lim f@), b= lim fQ). 


Suppose that f is not uniformly continuous. By the theoretical part of the text, 
we know that there exist two sequences (an)n>1 and (bn)n>1 such that lim (a,— 
= a noo 


b,) = 0 and € > O such that |f(a,) — f(b,)| > € for all n. Next, (a,)n>1 has 
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a limit point in R U {+00}; that is, for some increasing sequence (k,)n>1, We 
have lim ag, = L. Clearly, lim b, = L 
noo n—> 00 
Now, if / is real, the inequality |f(a,,) — f(bz,)| > € cannot hold for all n 
because f is continuous at l. Suppose for instance that l = oo. Then 


lim f(a) =b = lim f(by,). 
noo noo 
and again we reach a contradiction. 
As a consequence, a continuous function having oblique asymptotes at +00 
is in fact uniformly continuous. Indeed, if 


lim (f(x) — mx — n) = 0, 


with m,n € R, then g(x) = f(x) — mx — n defined on a neighborhood of +00 is 
uniformly continuous, because it has horizontal asymptote y = 0. Finally, 


F(x) = g(x) + (mx +n) 


is uniformly continuous, as a sum of two uniformly continuous functions. 
If we assume that f is uniformly continuous, then 


E 
Ve>0, du >0, xy < vı > F@ -f0 < z- 
Pick M > 0 with the following property: 
E 
x E€ [M, œ) > Fœ) — 8@)l < z- 


The function g is uniformly continuous on [0, M] , so we can find v2 > 0 for 
which 


E 
x,y € [0,M], |x—yl < v2 > |g) — 80) < 5 

Now, with v = min {v1, v2} , we can prove the implication 
x,y € [0, 00), x= y| < v > |g@) — 80) < 8, 


which solves the problem. The last implication is true in case x, y € [0, M]. 
If x, y € [M, oo), then 


Ig) — 80) s Is@) —F@I + F@ —FO)| + IFO) — 80)! 


E E È E 


Solutions 


13. 


If x € [0, M], y € [M, oo), then 


lex) — gQ)| < le) — g()| + |g) — g(M)| 


Cee 
<b Sug, 
Dy 2, 
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This result can be easily extended to continuous functions f,g : R —> R for 


which 


dim Fœ — g@)] = 0. 


As a direct consequence, continuous functions with oblique asymp 
uniformly continuous. Indeed, if 


lim [f(x) — mx- n] = 0, 


totes are 


then g(x) = mx + n is uniformly continuous, and so f is uniformly continuous. 
More generally, the result remains true even if f has different asymptotes at —oo 


and +00. 


Let T > 0 be a period of f. By the uniform continuity of f on the compact set 


[0, T] , we can state the following: 
Ve>0,dv>0, x,y € [0,7], xx- y| <v > F) -—fO)| < 
The value v can be chosen less than T. We prove now the implication 
ny ER, k -yl < v > FO -fO < e- 
IfO < y— x < v, then x,y € [(k— 1)T, kT] or 
xe[(k—1)T,kT], ye [kT, (k+ DT]. 
In the first case, 
FO - FO) < fæ- k- DT) -f0- k- DT) < 5 <e, 
because 
x—(k—1)T, y—(k—1)T e [0,T]. 
Otherwise, 
FO —FO)| FO — FED)| + FO) -FET 
= fæ- k- DT) ~FO)| + Ifo — kT) -f0 


EE 
= a Se. 
aD tad, 


In conclusion, f is uniformly continuous. 


E 
7 


226 


14. 


15. 
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Let v > 0 for which we have the implication 
x,y E R, |x-y| <v => |f@) -fO)| < 1. 


Let x € R, x Æ 0 and let n = [=] + 1, which is a positive integer. Then 


v 


kx (k-Dx 


n n 


’ 


and it follows that 


TOETO 53 (=) -1(= tx) 


<Ons + yO +1, 


This condition is not sufficient for uniform continuity. For example, the function 
f(x) = xsinx, x € R, is not uniformly continuous, as we have proved, but 
If(x)| < |x|, for all x. 

Let a, b be real numbers satisfying |f(x)| < a|x| + b, for all real numbers x. 
Then 


0 < lees = bl y (Z)| < k- (a [E| +4) = a- l+ b- ll. 


Obviously, g is continuous at the origin, and consequently, it is continuous on 
RxR. 


Chapter 14 
Derivatives and Functions’ Variation 


The derivatives of a differentiable function f : [a,b] —> R give us basic information 
about the variation of the function. For instance, it is well-known that if f’ > 0, 
then the function f is increasing and if f’ < 0, then f is decreasing. Also, a function 
defined on an interval having the derivative equal to zero is in fact constant. All of 
these are consequences of some very useful theorems due to Fermat, Cauchy, and 
Lagrange. Fermat’s theorem states that the derivative of a function vanishes at each 
interior extremum point of f. The proof is not difficult: suppose that xo is a local 
extremum, let us say a local minimum. Then f(x» + A) — f (xo) = O for all A in an 
open interval (—6, 5). By dividing by h and passing to the limit when h approaches 
0, we deduce that f’(xo) > 0 (for h > 0) and f’(xo) < 0 (for h < 0); thus f’(xo) = 0. 
Using this result, we can now easily prove the following useful theorem: 


Theorem (Rolle). Let f : [a,b] — R be continuous on |a, b] and differentiable 
on (a, b). If f(a) = f(b), then there exists c in (a, b) satisfying f’(c) = 0. 


As we said, the proof is not difficult. Because f is continuous on the compact 
set [a, b], it is bounded and attains its extrema. We may of course assume that f is 
not constant, so at least one extremum is not attained at a or b (here we use the 
fact that f(a) = f(b)). Thus there exists c € (a,b) a point of global maximum or 
minimum for f. By Fermat’s theorem, f” vanishes at this point. 0 

As a consequence, we obtain the following: 


Theorem (Lagrange’s mean value theorem). Let f : [a,b] — Rbe continuous 
on [a,b] and differentiable on (a,b). Then there exists c in (a,b) satisfying the 
equality 


f(b) -fa = b- af). 


The proof is immediate using Rolle’s theorem: the function 


h(x) = (b— F &) — f(a) — FO) -fE - a) 
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satisfies the conditions of Rolle’s theorem; thus its derivative vanishes at least at one 
point, which is equivalent to the statement of the theorem. L 
In exactly the same way, by considering the auxiliary function 


h(x) = (g(b) — s(@) F(a) — f(a) — FS) —f@)(g@) — 8). 


one can prove Cauchy’s theorem as an immediate consequence of Rolle’s theorem. 


Theorem (Cauchy’s mean value theorem). Let f,g : [a,b] — Rbe both 
continuous on [a, b] and differentiable on (a,b). Assume also that the derivative 
of g is never zero on (a, b). Then g(b) — g(a) 4 0 and there exists c in (a,b) such 
that 


fe) -f@) _ fO 
gŒ) -= g(a) s') 
Note that Lagrange’s theorem appears now to be a particular case of this last 


theorem, and also note that it proves the assertions from the beginning of the chapter. 
Yet another important result is the following. 


Theorem (Taylor’s formula). Let f : I —> R be ann + 1 times differentiable 
function on the interval I. Then, for every xo and x in 1, there exists c € I (depending 
on xo and x and between xy and x) such that 


£0) = PEO Ge ny 4 FOG — ay 
k=0 ` i 


Proof. Let F and G be defined by 


n pk 
FO =f- pe (xo) 


k=0 


(t —x yt 
kl (t — xo) and G(t) = LD 


for all t € J. One immediately sees that F and G are n + 1 times differentiable 
and that F (xo) = G (xo) = 0 for 0 < k < n, while FOD (A = fT) (A) and 
G"*)(t) = 1 for all t € I. Cauchy’s theorem applies to F and G on the interval 
from xo to x and yields the existence of some cı € J with property 


FQ) _ FQ)— Flo) _ Fe) 
CC® GQ) = Glo) Ger)’ 


By Cauchy’s theorem again (for F’ and G’ on the interval from x9 to c,), we obtain 
some cz € J such that 

Fc) _ F'(a)-— F(x) _ F" (c) 

G'a) Ela) — FH) (e) 
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We continue this procedure based on Cauchy’s theorem until we get 


Fœ) -. F' (cı) = F” (c2) ENES FO+D (en41) 
GO Gi) Ge) GED (Eq 41) 
for some c1, C2, ...,Cn+1 € Z (actually between xo and x). Now, with c = c,+1, the 


equality 


Fœ) — FOtve) 
G(x) G@"*D(c) 


= 6%) 


is precisely what we wanted to prove. LI 


So, we can deduce from Lagrange’s theorem the relationship between the 
derivative’s sign and the monotonicity of f. Moreover, the second derivative f” is 
closely related to the notion of convexity. Indeed, f is convex on intervals where f” 
is nonnegative and is concave on the intervals where f” is negative. Recall that f is 
called convex if for all t € [0, 1] and x, y € [a, b], the following inequality holds 


fA- Axt) A-A + FO), 
and it is concave if the reversed inequality holds for all t € [0, 1] and x, y € [a, b]. 
Any convex function satisfies Jensen’s inequality: for each positive integer n, for all 
Q1,...,@, € [0, 1] with sum 1, and for every x1, ..., Xn € [a, b], 


f(x +++ + nxn) < f 1) + +++ + Onf On). 


In particular, 


(===) < f) +- + fn) , 


n n 


for every x1,...,Xn E [a,b]. When the function is concave, each of the above 
inequalities is reversed. 
Let us see now how we can use the previous results in a few concrete problems. 


Problem. Let a), a2,...,an be positive real numbers. Prove that the inequality 
aq +o +e +a >n, 


holds for all real numbers x if and only if ajaz- -an = 1. 


Solution. If the product of all the a; is 1, we have, using the AM-GM inequality, 


x x x X gX Sps nare 
a +o ++ > nyd = ny (aaan) = n. 
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For the converse, we give two approaches. 


Method I. Let us define the function f : R > R, given by 
f@ =a +t Han. 


The given inequality can be written as f(x) > f(0), so O is a point of minimum. 
According to Fermat’s theorem, f’(0) = 0. But 


f' (x) = qj Ina, + a Ina) +--+ + a lna, 


thus Ina; + lna +--+: + Ina, = 0, which is exactly the desired condition. 


Method II. One can easily establish that 


e 1 
_ (aitatet+ay\'" 
lim = 4/7412 *** Ay. 


x0 n 


For x > 0, we have 


x x x\ 1/x 
tate tay y] (14.1) 
7 > : 
and for x < 0, 
x xX xN 1/x 
(A) <1 (14.2) 
P <l. : 


Now, if x N 0 in (14.1) and x Z 0 in (14.1), then we deduce that 2/ajaz---ay, > 1 
and 2/a,da°*+d, < 1, respectively. In conclusion, aja2--:a, = 1.0 


Problem. Solve in R the equation 4* — 3* = x. 
Solution. For each fixed real number x, define the function f : [3,4] > R, by 
fO) =y*, with f’(y) = xy*"!. According to Lagrange’s mean value theorem, 
FA -FO = 4-3f' (0) e #-3 = xc", 


for some c € (3,4). The given equation can be written as xc’! = x. One solution 
is x = 0. For x Æ 0, we derive œ! = 1 & x = 1. In conclusion, there are two 
solutions: x = 0 and x = 1.0 


Lagrange’s theorem also has important applications in transcendental number 
theory. We present here Liouville’s famous theorem, which shows that algebraic 
irrational numbers are badly approximable with rational numbers. This is the very 
first nontrivial result in this active field of research: 
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Theorem (Liouville). Let P be a nonzero polynomial with integer coefficients 
and degree n. Suppose that a is an irrational zero of P. Then there exists a constant 
C > 0 such that for all rational numbers with p,q € Z, and q # 0, we have 


C 
lq\"” 


a-?|> 
q 


Solution. It is clear that there exists 6 > 0 such that the only zero of f in J = 
[a — 5,a + ô] is a. Because P’ is continuous on J, it is bounded, and we can pick 
M such that M > |P’(x)| for all x € J. Using Lagrange’s mean value theorem, 


a— z| zL Ie (2)| for all x = : € I. Because P has integer 


we deduce that af 


is a nonzero integer; thus it is at least equal 


to 1. Thus for all A € I, we have |a — x| > Mar If x does not belong to /, then 


coefficients, the number |g|” | P (2) 


|x — a| > 6. Thus by taking C = min (ô, i) we always have ja- z| > ai" Oo 


The following result is often useful for studying the behavior of a function when 
we know some of its zeros and the behavior of its derivatives. One can also see it as 
a factorization result, similar to the well-known result for polynomials. 


Problem. Let f be a n times continuously differentiable function defined on 
[0, 1], with real values, and let a}, a2,...,a be n distinct real numbers in [0, 1] 
such that f(a;) = f(a.) = --- = f(a,) = 0. Then for all x € (0,1), there exists 
c € (0, 1) such that 


(x — a1) (x — aa) +++ (% — an) 


FQ) = ; FC). 
n! 
Solution. The case when x is one of aj,a2,...,d, being immediate, let us 
assume that x, a1, a2, . . . , an are pairwise distinct. Consider the function 


S(t) = (t= ai)(t— a) +++ (t= an f(x) — œ — a1) œ — az) -+ — an) FO, 


which is n times differentiable and satisfies g(x) = g(ai1) = g(a2) =--- = g(an) 
= 0. Let bọ < bı < +++ < bn be the numbers x,a1,...,a when arranged in 
increasing order. By Rolle’s theorem, g’ has at least n zeros, one in each (b;, bj+1). 
Applying again Rolle’s theorem and repeating the argument, we deduce that g”) 
vanishes at a point c € (0, 1). Thus 


nlf (x) = E- a) E- aa) æ an) f (0), 


which is exactly what we needed. L 


We continue with a particularly nice and surprising property of derivatives. 
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Theorem (Darboux). The derivative of a differentiable function has the inter- 
mediate value property. 


Proof I. The proof is not difficult at all. Take f a differentiable function and 
suppose that a < b and f'(a) < f'(b), for instance. Consider y € (f’(a),f’(b)). We 
need to prove that there exists x € (a,b) such that y = f'(x). Let g(t) = f(t) — yt. 
Then g’(a) < 0; thus there exists a’ > a such that g(a’) < g(a). Similarly, there 
exists b’ < b such that g(b’) < g(b). Because g is continuous on [a, b], it attains its 
minimum at a point x. The previous remarks show that x € (a, b). Thus by Fermat’s 
theorem, we have g'(x) = 0, that is, y = f'(x). 0 


Proof II. (See the note A New Proof of Darboux’s Theorem by Lars Olsen in 
The American Mathematical Monthly, 8/2004). Let (for c € [a, b]) he be the function 
defined for all x € [a, b] by 


fc), for x=c. 
Clearly, any he is continuous on the interval [a, b], and we have h,(a) = f'(a), 
h,(b) = f'(b), and ha(b) = h(a). Thus, if y is between f'(a) and f'(b), it is also 
either between h,(a) and h,(b) or between h,(a) and h,(b). Suppose we are in the 
first case. By the continuity of ha, we can find c between a and b such that y = 
h,(c) = (f(c) — f(a))/(c — a). But Lagrange’s mean value theorem ensures the 
existence of some x between a and c such that (f(c) — f(a))/(c — a) = f'(x); thus 
y = f'(x), and we are done. 

When y is between h,(a) and h,(b), we proceed analogously, by using the fact 
that h,, being continuous, has the intermediate value property. L 


We now present a very useful criterion for proving that a function is continuously 
differentiable k times. It is also a consequence of Lagrange’s theorem: 


Proposition. Let I be a nontrivial interval of the set of real numbers, xo € I and 
f : 1 — R continuous on I, differentiable at all points of I except xo and such that 
f' (x) has a finite limit when x tends to xo. Then f’ (xo) exists, and it is equal to this 
limit. 

Proof. Let 5,¢ > 0 be such that |f'(x)— 1| < e forallx € V = (xo—ô8, xo + 5), 
x Æ xo. From Lagrange’s theorem, for all x € V, x > x9, we have the existence of 
some c(x) between x and xo such that Hurtin = f'(c(x)). Therefore 


F(x) =f xo) 


X — Xo 


k- E: 


A similar argument shows that the last relation actually holds for all x € V, x Æ xo. 
This shows that f'(xọ) = 1.0 
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Problem. Let x) € R and € > 0. Prove that there exists an infinitely many times 
differentiable function f : R — R, not identically zero and which vanishes outside 
(xo — €, xo + €). 


Solution. First of all, consider the function g(x) = ei for x > 0 and 0 
otherwise. We claim that this function is infinitely many times differentiable on R. 
This is clear at all points except 0. Using the above proposition, it is enough to prove 
that g has a limit equal to 0 at 0 for all k. An inductive argument combined with 
the proposition would finish the proof. But it is very easy to prove by induction that 
g(x) equals 0 for x < 0 and Pibe: if x > 0, where P; is a polynomial with 
real coefficients and degree 2k. This shows that g(x) tends to 0 as x tends to 0 
(from the right), and so g is indeed infinitely many times differentiable at 0 and thus 
everywhere differentiable. Now, consider the function 


f(a) = gle? — (x — x0)°). 
This is also an infinitely many times differentiable function and, clearly, it is not 


identically zero. However, f(x) > 0 implies €? > (x—xo)”, that is, x € (xo—€, xo +€). 
This shows that this function is a solution of the problem. O 


We continue with some problems taken from the Putnam Mathematical Compe- 
tition. 


Problem. Let f, g : R — R with f twice differentiable g(x) > 0 for all x and 


F'O +O) = —xgQo)f'(). 


Prove that f is bounded. 
Solution. Observe that f’(x)f” (x) + fO (x) = —xg(x)(f’(x))” is negative for 
1 
positive x and positive for negative x. On the other hand, f’f” + ff’ = A F +EP, 


so f? +f has a maximum at 0. Thus |f(x)| < yf2(0) + f” (0) for all x and the 
conclusion follows. L 


Problem. Let f : R — [—1, 1] twice differentiable such that f(0)? + f’(0)* = 4. 
Prove that there exists xo such that f (xo) +f” (xo) = 0. 


Solution. Let g(x) = f(x)? + f'(x). By Lagrange’s theorem there is a € (0, 2) 
2) —f(0 
with f'(a) = An Clearly, |f’(a)| < 1, thus g(a) < 2. Working in the same 


way with (—2, 0), we find b € (—2, 0) with g(b) < 2. Since g(0) = 4, the maximum 
of g on [—2, 2] appears at some c € (—2, 2). Because g(c) > g(0) = 4, f’(c) £ 0. 
But g’(c) = 0, thus f(c) + f” (c) = 0.0 


Problem. How many real solutions does the equation 2* = 1 + x? have? 


Solution. Any such root is nonnegative since 2* > 1. If f(x) = 2* — x? — 1 then 
fO) = 0, f(1) = 0, f(4) < 0 and f(5) > 0, so f has at least three zeros. If f has 
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at least four zeros, by Rolle’s theorem f” would have at least two zeros, which is 
clearly wrong because f” (x) = (In 2)? - 2* — 2 is injective. Hence, the equation has 
three real solutions. L 


Proposed Problems 
1. Solve in [1, 00) the equation % — 8* = x’. 
2* + 2” = 16 
3* + 3” = 54° 
3. Find all positive real numbers a, such that the inequality 


2. Solve in R the system 


a” + 10 > 5+6 


holds for every real number x. 

4. Let f : R — R be a twice differentiable function whose graph meets the line 
y = x three times. Prove that f” (£) = 0, for at least one real value £. 

5. Let a, b, c, m,n, p be real numbers. Prove that 


asinx + b sin 2x + csin 3x + mcos x + n cos 2x + pcos 3x = 0 


for at least one real value x. 

6. Prove that the function f : R > R, given by f(x) = sinx + sin(x?) is not 
periodic. 

7. Let f,g : [a,b] —> R be differentiable, with continuous, positive, increasing 
derivatives. Prove that there exists c such that 


f(b) -f(@)_ gb) -ga 


Z; bog at Oro. 


8. Let f : R — R be a differentiable, bounded function. Prove that f(c)f’(c) = c, 
for some real number c. 
9. Let f : R — R be a convex function so that 


lim (f(a) —x— 1) = 0. 


Prove that f(x) > x + 1, for all real numbers x. 

10. Let f : R — R be a concave, strictly monotone function. Prove that f is 
unbounded from below. 

11. Let f : R — R bea three times differentiable function. Prove that there exists a 
real number c such that 


FOF OF OF" (©) = 0. 
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12. 


13. 


14. 
15. 


16. 


I7; 


18. 


19. 


20. 


21. 


Let f : R > R be a function with fourth derivative such that f(x) > 0, f 
(x) < 0, for all real numbers x. Prove that there exist real numbers a, b, c such 
that f(x) = ax? + bx + c, for all real x. 

Let f : R — R be a function with continuous third derivative such that 
Sf). F œ), f" œ), f” Œœ) are positive and f” (x) < f(x), for all real numbers x. 
Prove that (f”(0))* < 2f(0)f"(0). 


Prove the following equality: 2 arctan x + arcsin ee ans 
T 

Let f be a differentiable function defined on the set of real numbers, with values 

in the same set. If f has infinitely many zeros, prove that its derivative also has 

infinitely many zeros. 

Determine all differentiable functions f : R — R with the following two 


properties: 


(i) f’(x) = 0 for every integer x. 
(ii) If x € Rand f'(x) = 0, then f(x) = 0. 


Let a1, ... , an be real numbers, each greater than 1. If n > 2, prove that there is 
exactly one solution in the interval (0, 1) to 


Te — x“) = 1—x. 


i=l 


Let a, b, c, d be positive real numbers. Prove that 


a ai b if c 4 d a 
b+c ctd dt+ta a+b 
Prove that for all real t, and all œ > 2, 
ett + et E 2 < (e! + ey” = JE, 
Leta > 1 be areal number. Prove that 
Za i 1 
ae 
am +) (n+ 1) ((n +17! a) 
for any positive integer n. 
Prove that the inequality 
a b c 3 


+ + < 
@e@+1 bB+1 G+172 


holds for all positive real numbers a, b, and c for which abc = 1. 
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Solutions 


1. The solution is x = 1. Let us define the function f : [8, 9] > R, by fO) =», 
where x > 0 is arbitrary and fixed. There exists c € (8, 9) such that 


FO -F8 =f'OO-8) F- 8 =x, 


Now, the given equation can be written as xc’! = x”. We have x Æ 0; thus 
œ! = x. For x > 1, we have 


ol > e! >y, 
2. The solution is x = y = 3. Let us define 
fas; ga) =3, xeER. 


If x < 3, then y > 3, and the system is equivalent to 


f(x) +f) = 2f(3) a £0) -fB) =fGB) -—f&) 
g(x) + g(y) = 2863) g(y) — 8(3) = g(3) — g(x) ” 


so 


f(y) —f() = fO) — f(x) 
go) -gB g(3) — g(x)’ 


which is impossible. Indeed, by Cauchy’s theorem, 


f0)-f@) _fle) a OO _ se) 
g(y)— 83) gla) g(3)— g(x) glo) 
f' (x) 
g'(x) 


for some c; € (3, y) and c2 € (x, 3). The function x => is injective, so the 


equality 


fe) _ fer) 
gi(c1) —g’(c2) 


cannot be true. 
3. The answer is a = 3. Let f : R > R, given by 


fa =a4+10-5-6. 


According to the hypothesis, f is nonnegative, which is equivalent to f(x) > 
f(0). Hence f’(0) = 0. We have 


f' (x) = alna + 10" In 10 — 5" In 5 — 6" In6, 
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so f’(0) = Ins =~ = 0 > a = 3. Ifa = 3, the given inequality is 


(2 — 1)(5* — 3*) = 0, 


which is true. 
4. Assume that the function g(x) = f(x) —x, x € R, vanishes at a,b,c,a<b<c. 
According to Rolle’s theorem, 


g(a) = gb) =0 > g7 6) =0 Sf’) =1 
gb) = gc) = 0 > g'(&) =0 > f'(&) = 1, 
with £; € (a,b), & € (b,c). Further, 


FED =f &) = fE =, 


for some & € (&, &). 
5. Let us define the function f : R —> R, given by 


b 
fla) = ~acos.x— 5 cos 2x — 5 cos 3x + msinx + 5 sin 2x + © sin 3x. 
Obviously, 
b c 
0) = fQm) = -a -4 —- Í, 
FO = fx) = a-z- 


so we can find £ € (0,27) so that 
f'E) =0 © asing + bsin 2é + csin3€ + mcos E + ncos 2 + pcos 3& = 0. 


6. If we assume by contradiction that f(x) = sinx + sin(x?) is periodic, then its 
derivative must be also periodic. But then 


f (x) = cos x + 2x cos(x’) 


would be continuous and periodic; hence f’ must be bounded. This is a 
contradiction, because for all positive integers n, we have 


Je (v2nz) =cosV2nn +2V2nt > œ as n> oo. 
7. Let cy,c2 € (a,b), c1 < c2 be such that 


f(b) -fa 2. gb) — g(a) 


ea fe), 8 =e). 
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Consider the function F : [0, 1] —> R given by 


FO = f Og O —f (c1)8'(c2)- 


Then F(c,) < 0, F(c2) > 0, so there exists c € [0,1] such that F(c) = 0. 
That is, 


f)-F@) _ sb) - 8a) 


F ©)’ (©) =f (evg’ (2) = bg b-a 


. Define the function g(x) = f?(x) — x’, x € R. Because of boundedness of f, we 


have 
lim g(x) = lim g(x) = —-o. (14.1) 
xX — 00 xX 0O 


If g'(x) Æ 0, for all reals x, then g is strictly monotone, which contradicts (14.1). 
Thus there is c such that 


g'(c) = 0 $ AOO —2¢ =0  f(Of'(c) = c. 


. Let us suppose that f(a) < a+ 1, for some a. Because 


lim (f(x) —x—- 1) = 0, 
x—> 00O 
we can find b > a such that 


„IOIO 
b-a 


From the convexity of f, we deduce that the function ¢ : [b, co) > R, 


go) -100 
x—a 
is increasing, so 
pw = 9) = OSLO > y 
x—a 


> f(x) 2 mx —a) + f(a) 
=> f(x) -—x-1= (m—1)x+f(a)—-ma—-1. 


Now, if x — oo, we obtain 0 > oo, which is false. 
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10. Suppose, by way of contradiction, that f > 0 and that f is decreasing. If a,b € 


11. 


12. 


R,a < b, let M (xo, 0) be the intersection point of the line joining (a, f(a)) and 
(b, f (b)) with the x-axis. Then, from the concavity of f, it follows that f(x) < 0, 
for all x > xp, a contradiction. 

Let us suppose, by way of contradiction, that 


Of Cf" Cf") <0, YxER. 


In particular, f,f’,f”,f’” do not vanish on R, so they have constant sign. 
Assume that f > 0, so 


FOF OQS” <0, YxER. 


Iff < 0,f” < Oand f” < 0, then f is strictly decreasing, concave, and 
bounded from below, which is a contradiction. If f’ < 0,f” > 0,f’” > 0, then 
g = —f’ is strictly decreasing, concave, and g > 0, which is impossible. The 
case f” > 0 is proved analogously to be impossible. 

Alternatively, one can use Taylor’s theorem (with the remainder in 
Lagrange’s form); thus we have that, for all real x and ¢, there exist cx, such 
that 

T ) 


f@) =f@+ a & -p+ FG a 


Gan” 
therefore, f(x) > 0 and f” (x) < 0 together lead to the contradiction 


a 


0<fO4+>5,6-9 

for all x and t. Indeed, for fixed (but arbitrary) t, this is possible for all x only 
if f(t) = 0. (See also the following problem for such a reasoning.) But if f’ is 
identically zero, the same holds for f”, which is not the case. Similarly, f(x) < 0 
and f” (x) > 0 for all x are incompatible. Thus, if f has the same sign on R, and 
f’ also has constant sign on R, then necessarily f(x)f” (x) > 0 for all x € R. Of 
course, this also holds for f’, that is, our assumption leads to f’(x)f’”” (x) > 0 for 
all x € R and the contradiction follows. 

This is problem A3 from the 59th William Lowell Putnam Mathematical 
Competition, 1998. A generalization (for a 4k + 3 times differentiable function) 
appeared as problem 11472, proposed by Mahdi Makhul in The American 
Mathematical Monthly. 

By Taylor’s theorem, for all real numbers x and f, there exists cy, between x and 
t such that 


fe) = + ew -D+ o a Jé a 


(x—1) + ae as — a 
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Now, using the hypothesis, we have 


oss sso + On + Ou 7 +L %G_» 


SOO 
31 


for all x and t. Considering ¢ fixed, the right-hand side is an at most third- 
degree polynomial in x that must have only nonnegative values. Clearly, this can 
happen only if the coefficient of x° is 0 (i.e., the polynomial’s degree must be 
even). Thus we must have f°) (f) = 0 for any real number f—and this evidently 
means that f is either a second-degree polynomial or a nonnegative constant 
function. 

Of course, a similar statement (with an identical proof) holds for a real 
function that has a 2n-th order derivative (with n a positive integer), namely, 
if f(x) > 0 and f°” (x) < 0 for all x € R, then f is a polynomial function of 
even degree which is at most 2n — 2. 

Let us define the function ¢ : (—oo, 0] > R by the formula 


LO 2 4x") O-O 


$a) = 
for every real number x. We have 


(x) = xf) +f") —f"@) 


and @” (x) = f(0) — f” (x). The function f is increasing, because f’ > 0. Then 
for every x < 0, using the hypothesis again, we obtain f” (x) < f(x) < f(0) so 
go” > 0. Hence ¢’ is increasing and for x < 0, 6’(x) < ¢’(0) = 0. Further, ¢ is 
decreasing, so for all x < 0, 


$œ > (0) = f0) —f'(0). 
Thus 


“x? + xf" (0) + f°) > f'a) > 9, 


eo) = 


for all x € (—oo, 0). Because f (0), f’(0), f” (0) are positive, it follows that for 
all nonnegative x, 


0 
ec) =O 2 ey" +r@>0. 


Hence this inequality holds for all real x, so the discriminant of g(x) is negative: 


F”)? — 2f 0'0) < 0, 


which is what we wanted to prove. 
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14. 


15. 


16. 


17. 


Define the function 


x 
= 2arctanx + in ——, x> 1, 
f(x) arctan x + arcsin EF x 
with 
2 1 2x V 
7 
= + s =0 
TO= Ta (5) 


2x 2 
tek) 


hence f is constant on (1, co) (try showing this as an exercise!). Thus 
f(a) = lim fx) = x. 
x00 


Let (an)n>1 be a sequence of distinct zeros of f. One may say that the problem 
is obvious, because by Rolle’s theorem, f’ vanishes in all intervals (an, an+1). 
However, we must be careful, because there is no reason for these zeros to 
be increasingly ordered. The idea is that any infinite sequence has a monotone 
subsequence. So, by working with a subsequence of (an)n>1, We May assume 
that the sequence is monotone, say increasing. Then Rolle’s theorem says that 
f' has a zero inside each of the intervals (an, an+1); thus it has infinitely many 
Zeros. 

Clearly the identically 0 function satisfies both conditions from the problem 
statement. We prove that this is the only solution of the problem. 

First note that, by (i) and (ii), we need to have f(x) = f'(x) = 0 for all 
x € Z. Suppose there is a € R with f(a) # 0. Of course, a is not an integer, 
thus a € (n,n+1), where n = [a] (the integral part of a). Being continuous, f is 
bounded and attains its extrema on [n,n + 1]; hence we can find b,c € [n,n+ 1] 
with f(b) < f(x) < f(c) for all x € [n,n + 1]. If we have f(a) > 0, this implies 
f(c) > 0; thus c can be neither n nor n + 1 (since f is 0 at these points), that is, 
c € (n,n + 1). But in this case, Fermat’s theorem tells us that f’(c) = 0, while 
property (ii) of f implies f(c) = 0, which contradicts f(c) > 0. 

Similarly, f(a) < 0 leads to f(b) < 0, b € (n,n + 1), f'(b) = 0 and finally 
to the contradiction f(b) = 0. Thus f(a) 4 0 is impossible (for any a), and the 
only function that satisfies both conditions (i) and (ii) remains f : R > R given 
by f(x) = 0 for all x, as claimed. 

This problem was proposed for the 11th grade in the Romanian National 
Mathematics Olympiad in 2015. 

First we solve the problem in the case when all the numbers a; are equal. We 
begin by proving the following: 


Lemma. Leta > l and0 < b < 1 be given real numbers. The equation 
Cea = 


has exactly one solution in the interval (0, 1). 
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An immediate consequence is that the equation 
(—x")" =1-x 


has exactly one solution in the interval (0, 1) for any real number a greater than 
1 and any positive integer n > 2. 


Proof. Consider the function f : [0, 1] —> (0, co) defined by 
f(@) =x + (1—x)’, Yx € [0,1]. 


We have f(0) = f(1) = 1 and 


f(x) = ax! — Yx € [0, 1). 


b 
(1 —x)!-®’ 
Now we see that the graphs of the functions 
g(x) = ax”! and h(x) = b/(1— x)", x € [0, 1) 


either intersect at two points or they do not intersect at all, since g increases on 
this interval from 0 to a (its limit in 1) and it is convex or concave (as a > 2 
and a < 2, respectively), while h increases from b to oo and it is convex (the 
convexity of the functions is easy to check by using their second derivatives). 
But if g and h do not meet, f’ has the same sign (it is negative) on [0, 1), 
as one can easily see, which means that f decreases on [0,1). Since f is 
continuous on the entire interval [0, 1], this would lead to the conclusion that 
f(O) > lim,+; f(x) = f(1), which is false; it remains the only possibility that 
the graphs of g and h intersect at two points 0 < œ < f < 1 in the interval (0, 1) 
and that f’ is negative on (0, œ) and (£, 1) and positive on (a, 8); therefore, f 
decreases from f(0) = 1 to f(@) < 1, then it increases from f(@) to f(B), and, 
finally, it decreases again, from f(f) to f(1) = 1 (hence f(6) must be greater 
than 1). Clearly, f takes the value 1 exactly once, in the interval (œ, p) (apart 
from the endpoints 0 and 1 of the interval), which proves our first claim. 

As for the second claim, this is, as we said above, just a simple consequence 
of the first (proved above) part of the lemma; indeed, the equation (1 — x*)” = 
1 — x is equivalent to x* + (1 — x)!” = 1, and 1/n is positive and less than 1. 

Thus the lemma solves the case when the numbers qa),...,a, from the 
problem statement are all equal; with its help, we can also prove the existence 
of a solution of the equation 


[a —x%)=1-x 
j=l 


in the interval (0, 1). To do this, consider the function F : (0,1) > R, 
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F(x) = | | [0 —2%) | +x- 1, Yx € 0,1). 


j=l 


and suppose that a = min aj, b = max aj. We may assume that a < b, since 
l<j<n l<j<n 


the case a = b (that is when all the numbers are equal) has already been treated. 
Of course, we have a > 1 and b > 1. Because the above product has at least 
two factors (and all the factors are positive), we get the inequalities 


a=" 4x1 < |] [02] 42-1 < G-2’)" 4-2-1, Yx E (0,1). 
j=l 


According to the lemma, each of the equations (1—x*)"” +x = 1 and (1—x?)"+ 
x = | has precisely one solution in the interval (0, 1): there are c and d in this 
interval such that (1 — c*)" + c = 1 and (1 — d’)" + d = 1. From the above 
inequalities (replacing x with c and d, respectively), it follows that F(c) > 0 
and F(d) < 0; now a typical continuity argument allows the conclusion that F 
takes the value 0 between c and d, that is, in the interval (0, 1); in other words, 
the equation from the problem statement has at least one solution in (0, 1). 
In order to show the uniqueness of this solution, we rewrite the equation as 


1 1 


J0- 
j=l 


and observe that both functions u, v : [0, 1) —> (0, co), defined as 


1 


u(x) = = 


0- 


j=1 


and 


v(x) = 


1—x’ 
for all x € [0, 1), are strictly increasing, are strictly convex, and have the line 
x = l as a vertical asymptote. The graphs of both u and v start from the point 
with coordinates (0,1); therefore they must have at most one other common 
point, and this is exactly what we intended to prove. 

We still have to prove the claimed properties of the functions u and v. There 
is no problem with their monotony, and, also, the convexity of v follows easily 
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since its second derivative is 


n 2 
v (x) = io [0, 1). 


The convexity of u is more complicated to establish. 
We may write u(x) = 1/w(x) for 


w(x) = [ [0 — 2%); 
j=l 


hence 


n UWO- w(x)!" (x) 
“= Ww)? 


On the other hand, we have 
log w(x) = a log(1 — x”), 
j=l 


and, by differentiation, we get 


Differentiating one more time yields 


n 


waw -= (W'@))y? > aj[(a; — 1)x0~? + 2°47] 
(w(x))? - (1 xi) 


’ 


j=l 
and the inequality 
w(x)w" (x) — (w < 0 


follows for any x € (0, 1). From this inequality, the positivity of the numerator 
of u” (x) is a direct consequence; hence (since the denominator is also positive) 
u(x) > 0 for all x € (0, 1)—and the convexity of u—is established, completing 
the proof. 

This is problem 11226, proposed by Franck Beaucoup and Tamas Erdélyi in 
The American Mathematical Monthly 6/2006. A simpler solution can be found 
in Chapter 11. 
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1 
18. The function f(x) = —, x > 0, is convex and we can use Jensen’s inequality. 


Assume, without loss of generality, that a + b+ c + d = 1. Then 


a i b z c i d 
b+c c+d d+a a+b 


=af(b + c) + bf(c +d) + f(d +a) + dfa +b) 
> f(a(b +c) + b(c +d) + c(d +a) + dla + b)) 


1 
= > 
ab + 2ac + ad + be + 2bd+ cd ~ 


because 
2(ab + 2ac + ad + bc + 2bd + cd) < (a+b+ec+d)=1 
© (a-c)? + (b- d? > 0, 


which is true. 
19. Claim 1. For all real numbers f > 1 and y > 1, the inequality 


O-)O+)% -yt +120 


holds. For 0 < y < 1, the reverse inequality is true. 


Proof. The function y +> yê is strictly convex on the interval (0, 00) for 6 > 1, 
and if y > 1, the numbers 1/y and (y— 1)/y are both nonnegative and sum to 1. 
Then Jensen’s inequality yields 


1 =i 1 —1 f 
y y y y 


which can be rearranged to get the desired inequality. For 0 < y < 1, we obtain, 
again with Jensen’s inequality, 


yy +-+ DF = y+ -yo+ = 1. 


In both cases, the inequality is strict except for y = 1. 
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Claim 2. For a > 2, the function f : (0, c0) — R, defined by 


oe 1 
fla) = (x+ z) — x =g Yx € (0,00), 


attains its minimum value at x = 1. 
Proof. The function f is differentiable and has the derivative 


a 
xerl 


f= [rie + DT aa H 1], Ves 0. 
For y = x? and 8 = a — 1 > 1 in claim 1, one sees that the expression in the 
square brackets is positive for x > 1 and negative for 0 < x < 1 (and 0 only 
for x = 1), respectively. It follows that f decreases in the interval (0, 1] and 
increases on [1, oo), having thus (as claimed) an absolute minimum at 1. 

This implies, of course, the inequality f(x) > f(1) for all x > 0 or 


1\° 1 
(+i) —x* — — > 2—2, Vx >Q. 
x Deus 


Just put here x = e' (for any real f) and rearrange a bit to get precisely the 
inequality stated in the problem. 

This is problem 11369 proposed by Donald Knuth in The American Math- 
ematical Monthly, 6/2008. The interested reader will find another solution, a 
generalization, and other related inequalities in Grahame Bennett’s paper p- 
Free P Inequalities, in The American Mathematical Monthly, 4/2010. 

We want to prove that 


1 


(n+1) (0 +1)! = nv) ,Wn > 1, 


§(a) — f(a) > 


where 
ia 
h&a = >> = 
j=l! 


and C(a) = limy-+o0 ,(a) is the Riemann zeta function. We can rewrite the 
desired inequality as 


f(a) > (a) + : ,Yn>1 


(n+ 1) ((n ate = i) 
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and we can prove this if we show that the sequence on the right hand side tends 
to €(a) and is increasing. So denote 


1 


,Wn > 1l, 
(n+1)(@4 D =n) 


Xn (a) = 6, (a) + 


and notice first that (x, (a)),>, has the limit ¢ (a), since 


lim x, (a) = (a) + lim y 
nro x0 (4 1) ((« + pa! 1) 


: xe! x =. 
= $(@) + lim 5 rq A 


Next 


1 
aF 
OD (142) (+21 = 4D) 


Xn+1 (a) — Xn (a) = 


1 
Ea ((n +1)! — yt) i 


for any n > 1; thus the inequality x,4) (a) > xn (a) becomes 


1 1 1 
(n+ 27+ n+ HEDGE Dr) OED" 


or 


1 ne! 


G+J aF GE De Donen "= | 


After some small calculation, we find this is equivalent to 
(n+ 1941 yn! (nt 1%! > ni! (n 4-2)", Vn > 1 
and (after multiplication by n + 1) to 
(n+ 1) +n! (n+ 1) >n (n tD +7! (142). 
Finally, we need to prove that 


(n? +2n+ 1)" — (n° + 2n)“ > nt! (n+ Df -— (n+ 1), Vn>= 1. 
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Lagrange’s theorem for the function x +> x“, on the intervals (n? + 2n, n? + 
2n + 1) and (n + 1,n + 2) yields the existence of some 


c € (n? + 2n,n? +2n+ 1), and de (n+1,n+2), 


such that 
(n? + 2n + 1)“ — (n? + 2n)* = ac", 
and 
(n+ 2)* — (n+ 1)" = aa™". 


Since a — 1 > 0, we have 


(n? + 2n + 1)“ — (n? + 2n)" >a (n? + 2n) , 


and 
(n+ 2)* — (n+ 1)* <a(n+2)"". 
Thus our inequality follows like this: 


(n° +2n+ 1)" — (n? + 2n)“ >a (n? + 2n) 


= an"! (n+ 2)! > nt! ((n+2)*-(n+ 1), 


and the proof is now complete. 
Lemma 1. We have 


x y 2a 
+ < 
Sas oe a ee +l 


for any a € [1/72 1] and any positive x, y such that xy = a. 
Proof. By replacing y with a” /x, we get the equivalent form 


x a? x? 2a 
+4 < 
XL+1 ee CR g? +l 


of the inequality that we have to prove. After clearing the denominators and 
some more simple (but prosaic) calculations, this becomes: 


2ax® — (a? + a)x° — (a? + 1)x* + 2(a7 + a) 


— (a? +a?) — (a? + ax + 207 > 0. 


Solutions 249 


As expected, the sixth-degree polynomial on the left-hand side is divisible by 
(x- a)’; thus we can rewrite this as 


(x — a)? (2ax* + Ba? — a)? — (20° — 3a? + 1)x? + Bart — ax + 20°) > 0. 
Now, the hypothesis œ € [1/2, 1] implies a? € [1/2, 1]; thus 


—(2a° — 3a? + 1) = 207 — 1)(1 — a3) > 0 
and, also, 
3a? — a? = 2g? + a7(1—a7) > 0 and 3a*— a” = 204 + a*(1 — a) > 0; 
therefore 
2Qox* + (3a? — a?) — (20° — 3a3 + 1)x? + (Bal — a7)x + 20° 


is positive, as a sum of nonnegative terms (some of them being even strictly 
positive), and the first lemma is proved. 
Lemma 2. The inequality 
ar r 3 
+ < 
rae i 7 a a 2 


is true for any positive t. 
Proof. After clearing the denominators and some further calculations, the 
inequality is equivalent to 
3P — 4È + 3 — 26 3P 630 
S (t= 1P BE + 28 +f + 34 3P + 37 + 6+ 3) > 0, 


which is evident for t > 0. 


Now let us solve our problem. Because we have abc = 1 (and a,b,c > 0), 
one of the three numbers has to be greater than (or equal to) 1. Suppose (without 
loss of generality, due to the symmetry) that this is c and consider two cases. 


(i) First, assume that c € [1 3al; then 
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2 
and ab = 1 = ( 4) . According to the first lemma, we have 


a b 27 2c 


Pal” TS 3 ~ efe+ 
(a + OM 


and this yields 


a j b a c z 2c 4 c 
+1 B41 +17 cecyce+1 +l 


But lemma 2 (for t = „/c) tells us that 


2c c 3 
+ L 
cyc+1 č&+172 


so we get the desired inequality: 


a 4 b “8 co 2c ag c <23 
+1 B+1 +17 ceye+1 G+17 2 


Suppose now that c is greater than 4/4. One can easily observe that the 
function 


Fi Ox) 0.) 7O=— — vis 0 


+1 
has the derivative 
1-2/7 
"(A = ————., Yt > 0; 
Marar 
therefore the function increases Pii to 7 and decreases from 7 to 
oo, having an absolute maximum at wR! 
f(t) <f bs) ead Vom 
— i) ama 3 g * 


In particular, f(a) and f(b) are less than (or equal to) 1V4. Regarding f (c), 
we have 


cz fO SAYD = V4, 
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because of the monotony of the function f. Finally 


a b co 


241 wal ea f(a) + f(b) + fo) 


1 1)\;3 13 ; 13 8 104 3 
<(2 st) p 5 95 “2 
(the inequality */4 < 1.6 is equivalent to 4 < 1.6° = 4.096; thus it is true) 
and the proof ends here. 

This is problem 245 (rephrased) from Gazeta Matematicd, seria A, 
3/2007. In the same magazine, number 3/2008, one can find two more 
solutions (due to Marius Olteanu and Ilie Bulacu) and a generalization from 
Tlie Bulacu. 


Remarks. 1) One can also prove lemma 2 by using the derivative of t => 
2? / (P +1) + 2/(t% + 1). (It is just a matter of taste; the computations are 
of the same difficulty.) Indeed, the derivative is 


—2r (PP — 1) +36 — 1) +30 — 1)) 
(B + 1)2(t6 + 1)? 


and the function has a maximum at 1 (equal to 3/2). 
2) The reader is invited to prove, in the same vein, that 


a P b P c <3 
at+1 b441 +I? 


for all positive a, b, and c with abc = 1. 


Chapter 15 
Riemann and Darboux Sums 


Let f : [a,b] —> R be continuous and positive. By the subgraph of f, we mean the 
region from the xy-plane delimited by the x-axis, the lines x = a,x = b, and the 
curve y = f(x). More precisely, the subgraph is the set 

{(x,y) ER |a<x<b, 0<y<f@}. 


We study the problem of estimation of the area of the subgraph. A method is to 
consider a division (or partition) of the interval [a, b], i.e., 


A = (a = Xo < X1 < +++ < Xn-1 < Xn = b). 
The real number denoted 


| Al] = max (xk — xk-1) 
1<k<n 


is called the norm of A. Next, choose arbitrary points 
& € [krix], 1S k <n, 


which we call a system of intermediate points. Then the area of the subgraph is 
approximated by the following sum denoted 


ont, &) = X SED Gr — x1) 
k=1 


and called a Riemann sum. Our intuition says that this estimation becomes better if 
the norm of the division is smaller. 
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This result remains true for a more general class of functions, namely, Riemann 
integrable functions. 


Definition. A function f : [a,b] —> R is called Riemann integrable (or just 
integrable) if there exists a real number 7 having the following property: for every 
€ > 0, there exists 6(€) > 0 such that for every division A of [a, b] with || A|| < 5(e) 
and for every system of intermediate points (&))<,<,, we have |oa(f, &) — I| < €. 


If it exists, J with this property is unique. (We encourage the reader to prove this 
uniqueness property.) Thus we may use a special notation for I, namely, 


1= [row 


and we call J the Riemann integral, or simply the (definite) integral of f on the 
interval [a, b]|—or from a to b. Continuous functions and, also, monotone functions 
are Riemann integrable—the reader can find the proofs of these properties in any 
calculus book. 

Clearly, the definition tells us that 


[roa tn 8) 


for every sequence of partitions (A,),>1, with lim ||A,,|| = 0, and for any choice 
rs} noo 


of the intermediate points gW in the intervals of A, n > 1. For instance, consider 


b-a 
the partitions ^, determined by the points x”) =a+k—,0<k<n, 
n 


b-a 


b— b— b— 
T E E E EE E =b, 
n n n 


a<a+ 


n 


and the system of intermediate points 


atk 


b-a b-a b-a 
E€ |a+(k-—1) „a+k ,l<k<n. 
n n n 


Since ag = a) = (b — a)/n, this A, is called an equidistant partition (all its 


intervals have the same length), and the corresponding Riemann sum is 


OA, (e) = ae (a+). 
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Thus we have a method to compute the limit of a class of convergent sequences: 


Proposition. For every Riemann integrable function f : [a,b] > R, we have 


In particular, iff : [0,1] —> R is Riemann integrable, then 
Lo (k ; 

lim -ò f (=) = f f(x) dx. 
n>oon E n 0 


One of the most important results comes now. Because it connects the operations 
of differentiation and integration, the theorem that follows is called the fundamental 
theorem of calculus: 


Theorem. a) Let f : [a,b] — R be any continuous function on [a, b], and let 


Fay = [£0 ae 
for all x € [a, b]. Then F is differentiable on [a, b], and 


F(x) = f() 


for all x € [a, b]. 

b) Let f : [a,b] —> R be an integrable function on [a, b]. Moreover, assume that 
f has an antiderivative F : [a,b] — R. That is, F is differentiable on [a, b] and 
F' (x) = f(x) for every x € [a, b]. Then 


b 
i f(x)dx = F(b) — F(a). 


Proof. a) We have, for (fixed, but arbitrary) xo € [a, b], 
Fla) — Flo) = | O dt = e-e 
x0 


for some c, between x and x (the last equality follows from the mean value 
theorem for the definite integral—see the first proposed problem). Thus 

F(x) — F(x) 

oe Efo) 

X— Xo 

has limit f (xo) for x tending to xọ (which makes c, tend to x) as well; remember 
the continuity of f). This means that F’(xo) exists and equals f(x) at any x9 € 
[a, b], which we intended to prove. 
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b) Let A, be a sequence of divisions with the sequence of their norms convergent 
to 0 (e.g., we can take the equidistant divisions of [a, b]). If 


a= x” <x” <- <x =b 


are the points that define A, we can apply to r Lagrange’s mean value theorem, 


thus getting some &” in each interval [x , x] such that 


F (P) — F (2) = (0 22) 7 (82) 


(do not forget that the derivative of F is f). Consequently, 


rO — Fla) = Y (F) -F =E P LAL). 
k=1 k=1 


that is, 


F() — F(a) = oa, (f£), 


for all n > 1. Now we only have to take the limit as n — ox, in order to obtain 
the desired result: 


F(b) — F(a) = lim oa, (r. (E) -f f(x) dx. 


The second part of the fundamental theorem of calculus is usually named the 
Newton-Leibniz formula and serves to evaluate definite integrals with the help of 
derivatives. The notation 


b 
ro 


is often used for F(b) — F(a). Thus 


b 
f fdr = Fo), = F(b) — F(a). 


Also note that if f is a continuous function defined on any interval J, and a is any 
point in Z, then F defined by 


F(x) = f f(t) dt 


for all x € Z is the antiderivative of f on J. 
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Example. Compute lim a,, where a, = —— + +-+ ; 
P P poet on n+1 n+2 n+n 


Solution. We have 


a | 
a ary a 


n 


arD iC) 


1 
where f(x) = ES x € [0, 1]. According to the above theoretical results, 
x 


n 
k=1 


1 1 1 
lim a= f dx = Ia(1 + x)| OE 
noo 0 l+x 0 


Another method to introduce the notion of integrability is due to Darboux. Let 
f : [a,b] —> R be a bounded function. For a division 


a = Xo < X1 < +t < Xp < Xn =b 
define 


m= inf f(x), Me= sup f(x), 1<kz<n. 


xE[rr—1 xa] xE[xk—1 xk] 


The sums denoted by 


saf) = J mar xe), Saf) = D5 Mee — a1) 
k=1 


k=1 


are called the inferior Darboux sum, and the superior Darboux sum of f, respectively. 
We have the following results (the interested reader can find their proofs in any 
calculus book—see, for instance, [12]): 


Theorem. Let f : [a,b] — R be bounded. The following assertions are 
equivalent: 


a) fis Riemann integrable. 
b) forall ¢ > 0, there exists 6(€) > 0 such that for all divisions A with ||A|| < 6(e), 
we have 


Sa(f) —sa(f) < E. 


This also holds in the following slightly different form: 


Theorem. The function f is Riemann integrable on [a, b] if and only if for every 
€ > 0, there exists a partition A of [a, b] for which Sa (f) — sa(f) < €. 
We also mention a powerful result of Lebesgue. 
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Theorem (Lebesgue). A bounded function f : [a,b| — R is integrable if and 
only if it is continuous almost everywhere in [a, b], that is, for any € > 0, there is a 
family of intervals with sum of lengths at most € such that any point of discontinuity 
of f lies in an interval of the family. (We also say that the set of discontinuities of f 
has Lebesgue measure zero or that it is a null set.) 


The proof of this result is highly nontrivial and will not be presented here. We 
show however some consequences of this theorem: first of all, the product of two 
Riemann integrable functions is Riemann integrable. Trying to prove this using the 
definition is not an easy task, but noting that 


OED sas 


fg 5 


reduces the problem to proving that the square of an integrable function is integrable. 
This follows immediately from Lebesgue’s criterion, because the discontinuities of 
f? are among those of f. A more general result (with essentially the same proof) is 
the following: 


Problem. Let f : [a,b] — [a, 8] be a Riemann integrable function, and let 
g : [a, B] — R be continuous. Prove that g o f is Riemann integrable on Ja, b]. 


Solution I. Let € be an arbitrary positive number. Because g is continuous on 
the compact interval |œ, £], it is also uniformly continuous; therefore we can find a 
positive 6’ such that 


|e(x) — go) < —§ whenever x,y € [æ, B] and |x—y| <8’. 
2(b — a) 


Yet, g is bounded, so there exists M > 0 such that |g(x)| < M for all x € [a, £]. 


Now let 6 be a positive number, less than both 6’ and €/(4M). Because f is 
Riemann integrable on [a, b], there exists 7 > 0 such that whenever A is a partition 
of [a, b] with ||A|| < n, we have 


Sa(f) — saf) < 6. 


Let us consider such a partition A = (a = x9 < xı < +- < x, = b) of [a,b], 
with ||A|| < n. We denote by m;(f) and M;(f) the lower and upper bound of f, 
respectively, in the interval [x;—1, x;]; m;(gof) and M;(gof) have similar significance 
for g o f. We have 


Salg of) —sa(gof) = X 0% — xi-1)(Milg of) — milg of) 


i=1 


= Gi -xi (Mile of) — mig of)) +) Ci — 1) (Mile of) — mig 0 f)). 
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In the first sum (i.e., in X- ^), we collect the terms corresponding to indices 1 < i < n 
for which M;(f) — m;(f) < 6 (and, therefore, M;(f) — m;(f) < 6’, too), while the 
second sum is over those i for which M;(f) — m;(f) > 6. Consequently, 


€ 
Mi(g of) —mi(gof) < 2b —a) 


for every i in the first sum (as |x — y| < Mi(f) — mi(f) < 6’ for all x,y € [xi-1, xi]). 
We further have 


8° > ye Gy — xi-1)(Mi(f) — mi(f)) = D'o — x;—1)ô, 
hence 


ae? —xj-1) < ô. 


Putting all these together, we finally get 


Salg of) = salg of) = X Ci — xi) M: of) — mi (g of) 


i=1 


= $ C — x1) (Milg of) — mig of) + 95" Gi — x1) (Milg of) — mi(g of) 
< wi D'a — Xi—1) + 2M "Qi — xi-1) 


€ € € 
<—— (b-a) +2M8 <+ =e, 
$ a. a) + Sa tS € 


and this happens for every partition A of [a,b], with ||A|| < n. Of course, this 
means that g o f is integrable on [a, b], which we intended to prove. O 


Solution II. Lebesgue’s integrability criterion allows an almost one-line proof 
of this result. Indeed, we clearly have g o f bounded (for g is continuous on the 
compact [a, 6]). On the other hand, if Dy and D,op are the sets of discontinuities of 
f and go f, respectively, on [a, b], we obviously have Dgof © Dy (for continuous g). 
(Equivalently we can say that if f is continuous at some point t € [a, b], then g of 
is also continuous at t.) So, if f is Riemann integrable, then Dp has null Lebesgue 
measure, implying that Dof has null Lebesgue measure as well. Thus, g o f is also 
Riemann integrable, finishing the proof. O 


A slightly more involved result is the following: 


Problem. Iff : [a,b] —> R has the property that f+sin(f) is Riemann integrable, 
then so is f. 
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Solution. Using Lebesgue’s theorem, this is not difficult. Indeed, first of all note 
that h = f + sin(f) is bounded, so f is also bounded, because |f(x)| < 1 + |A(x)]. 
Now, note that the function g(x) = x+ sin x is continuous and bijective. Indeed, it is 
increasing and has limits +00 at +00. Suppose that xo is a point of discontinuity of 
f, without being a point of discontinuity of h. Then there exists a sequence (yn)n>1 
that converges to xo and such that (f(,))n>1 does not converge to f (xo). However, 
we know that (g(f(vn)))n>1 converges to g(f(xo)). Let / be any limit point of the 
sequence (f(yn))n>1 (i.e., l is the limit of some subsequence of (f(yn))n>1). Then 
g(l) = g(f(%o)) and so L = f(xo). Thus the bounded sequence (f(¥,))n>1 has the 
property that all its convergent subsequences have the same limit, f(xo). It means 
that it converges to f(xo), a contradiction. Thus the discontinuity points of f are 
among the discontinuity points of h, and thus the conditions of Lebesgue’s criterion 
are satisfied for f, which shows that f is Riemann integrable. 0 


An important consequence of Lebesgue’s theorem is the following fact: 


Problem. Any function f : [a,b] —> R which has finite one-sided limits at any 
point is Riemann integrable. 


Solution. Indeed, let us prove first of all that f is bounded. Assuming the 
contrary, we can find a sequence x, € [a,b] such that |f(x,)| > n for all n. This 
sequence has a convergent subsequence (%;,)n>1, Whose limit is l € [a,b]. The 
inequality |f(xx,)| > kn shows that one of the one-sided limits of f at / is infinite, 
which is a contradiction. Thus f is bounded on Ja, b]. 

Next, let us prove that f is continuous almost everywhere. It is enough to prove 
that the set of discontinuities of f is at most countable. For any x, let 


fot) = limf) 
and 
fQ-) = fay (7). 


Clearly, it is enough to prove that the set of points x where f(x+) > f(x—) is at most 
countable. But this set is the union of Ap q, where A, is the set of real numbers 
x € [a,b] such that f(x—) < p < q < f(x+) and the union is taken over all pairs 
(p, q) of rational numbers such that p < q. Clearly, any point of A, 4 is isolated. So, 
because the union is taken over a countable set of pairs (p, q), it is enough to prove 
that each A, , is countable. This will follow if we manage to prove that a set X of real 
numbers all of whose points are isolated is at most countable. Let [,,h,... be all 
open intervals whose extremities are rational numbers. For all x € X, we know that 
there exists n(x) such that x is the only common point of X and J,,,.). The function 
n defined on X is clearly injective, and because its values are positive integers, X 
is at most countable. Thus f is continuous almost everywhere (we leave as an easy 
exercise for the reader to prove that a countable set can be covered with a family of 
intervals whose sum of lengths is at most £, for any € > 0), and so f is Riemann 
integrable. O 


15 Riemann and Darboux Sums 261 


The following result is weaker than Lebesgue’s theorem, but still nontrivial: 


Problem. A Riemann integrable function f : [0,1] —> R is continuous on a 
dense set of [0, 1]. 


Solution. Let 0 < a < b < 1. We will prove that f is continuous at a point 
in [a, b]. Using Darboux’s criterion, we can construct by induction a sequence of 
nested intervals 7, = Jan, bn] such that J,4; is a subset of (an, bn), Jo = [a,b], 
bn — an converges to 0, and 


sup f(x) — inf f(x) < 2 
xEln n 


xEln 


for all n. Indeed, suppose we constructed J,. There exists 6 > 0 such that for any 
division A with ||A|] < ô we have 


— an 
S < ——— 
af) — sa(f) ot +)’ 
Let r = min( aa thaw ma §). For any division A = (xo, x1, . - . , xk) such that || Al] < 


r, let a’,,, be the sinallest x; among those which belong to (dy, bn) and tet b’,,, be the 


greatest such x;. Clearly, [a),, ;, b/,, |] is a subset of (an, bn) and b),,;—@),4, = bazan : 
Also, let 


c= / . eee . = / 
Xio = Ang < Xin +1 < < Xio+p = bapi 


be the points of A that belong to [d’,, ,, D/,,,]. Finally, let 


S;= sup f(x)— inf f(x). 


x€[x;.xj41] x€lyi x41] 
Then 


1 
by — < 
GD >y Soti Xiti+ — Xi): 


j=0 


If Si+ is the smallest among the Si +;, we deduce that 


by — an a 1 
241 anD T n 


Sio < 


Therefore we can choose an+1 = Xip+ı and bn+1 = Xip+i41. Now, using the lemma 
of nested intervals, there exists xọ belonging to the intersection of all 7,. Clearly, f 
is continuous at xp. O 
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Finally, a beautiful application of Riemann sums and subtle estimations appears 
in the solution to the following problem, taken from a Romanian Olympiad. 


Problem. Let f : [0,0o) — R be a 1-periodic and Riemann integrable function 
on [0, 1]. For a strictly increasing, unbounded sequence (xn)„>o , With xo = 0 and 
lim (x41 — Xn) = 0, denote 
n—>Co 


r(n) = max {kE N| x, <n}. 


a) Prove that 


r(n) 1 
lim p Len — xf) = f F(x) dx. 


noo 


b) Prove that 


— 1 fink) f' 
gree : = | feds 


=1 


Solution. a) Let us denote 


Sp = > r+ — xO) f%), pel. 


pP—1<xk <p 
Then, if 


r(n) 
an = = 9 Geri =f), 


k=1 


ie , ; 
we have a, = — > Sp and according to the Cesàro-Stolz theorem, lim a, = 
n i noo 
p= 
lim s,, if the limit on the right-hand side exists. However, note that 
noo 


s= > @H-wafe= >) OyO 


n—1<xg<n O0<xp—(n—1) <1 


with yk = x, — (n — 1), represents the Riemann sum related to the function f and 
the division (Vx),(n—1)<k<,(n) Which tends to zero in norm, as n — oo. Also note 
that f (x) = f (yk) because f is 1-periodic. Thus, 


15 Riemann and Darboux Sums 


1 
lim Sn = / I (x) dx. 
n—>Co 0 


b) For x, = Inn, we have 


le"] 


1 k+1 


k 


=1 


We also have 
1 
lim Stan} = I f(x) dx, 
noo 0 


so 


elinn] 
p kemi] 


Now, 


i elln | 


Let us prove that 
1 : k+1 
lim — y In ——f (Ink) = 0. 
im n x -f (in k) 


n>oo Inn 
k=[ellnl]+1 


First, with M = sup,ejo,1 |. 


n n 


k+1 k+1 
X h fno] <m: X h > = Min 


k=[elin™l] +1 k=[ellml]+1 


which tends to zero, as n — ov. Finally, 


Sn = 2a -—— f(k) > Í EE dx. 


1 
Inn] D m= fano > f fO a as n— oœ. 


n n 
k+1 1 k+1 1 
l Ink) = l Ink) + — 
Kea" poe inn ne thn D 


= k= [elna] +1 


1 &/1 k+1 
Sohih k ) ran‘) 


k=1 


jes fa 
<M. 
x Inn = k 


14+54+---+4-In+ 1) 
: > 


Inn 


=M 
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k+1 
In t stink), 
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Proposed Problems 


1. a) Prove the monotonicity property of the Riemann integral, namely, that if f, g : 
[a,b] — R are two Riemann integrable functions such that f < g on [a,b], 


then 
b b 
f toas [sea 


b) Prove the mean value property for the Riemann integral. Namely, show that if 
f : [a, b] is a continuous function, then there exists c € [a, b] such that 


b 
| fe = 6-aro, 


2. Find the limit of th S sin 
. Find the limit of the sequence a, = ee 
E ivn +k 
n 1 


3. Find the limit of the sequence a, = 5 5 n>. 
ikt [vn +kn+k 


4. Does there exist a Riemann integrable function f : [0,1] —> R such that for 
every p,q € (0,1),p < q there exist c,d € (p,q) for which f(c) = c? and 
f(d) = d? 

5. Letf : [0, 1] — R be a Riemann integrable function such that 


o(s 


n— m 


1 
for all positive integers m < n, gcd(m, n) = 1. Compute / f(x)dx. 
0 
6. Let f : [0, 1] — R be integrable such that 


PO FIO < Fp), 


for every x, y in [0, 1]. Prove that if f is continuous at e~!, then 
1 
OLE 
0 


7. Letf : [a,b] — R be with the property that for all € > 0, the set 
tx € [a,b] | FO) > s 


1 
is finite or empty. Prove that f is integrable and | f(x) dx = 0. 
0 
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8. Let f : [0,1] — R be an integrable function such that for every 
p.q € (0,1),p < q, there exists € € (p,q) for which f(€) = 0. Prove that 
i 
f(x) dx = 0. 
0 
9. Let f : [0,1] — R be an integrable function such that for every 
p.q € (0,1),p < q, there exist c,d € (p,q) for which f(c) + f(d) = 2. 
Prove that 


1 
| roa= 1. 
0 


10. Let f, g : [0,1] — R be increasing, with g(0) > 0 and let (an)n>1 C (0,00) be 
such that (nan)n>1 converges to 1 and is monotonically increasing. Compute 


ii f (Gn) + f(2an) + +++ + fnan) 
n>00 g(an) + g(2dn) + +++ + gnan) 


11. Prove that the function f : [0,1] —> R, given by 


1 


fair 9 
0, xA#A-,neEN* 
n 


1 
x=-,neN* 
n 


1 
is integrable and | f(x)dx = 0. 
0 
12. Let f : [0, 1] —> R be increasing. Prove that 


: eee gc 
[row t0(2) 


for all positive integers n. 
13. Let a; = 0.5, and an+ı = ~ 1 + na, for n > 1. Compute 


_fW =F) 


n 


2 n 
li EEE E 
jim (tat 1) 
14. Let f, g : [a,b] —> R be such that 
Ifa) —fO)| < l) - s0)| 


for all x, y in [a, b]. Prove that f is integrable if g is integrable. 
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15. Let f : [0, 1] —> R be integrable. Prove that 


wf tO) HOt) er COR 


noo n 


= 0. 


16. Let f : [0, 1] > R be integrable and let (a,,),,., be a sequence such that 


<1, 


Da 
k=1 
for all positive integers n. Prove that 
Lok 
lim — = =0. 
POXOL 


17. Letf : [0, 1] —> R be a continuously differentiable function. Prove that 


l l eek f) -fO 
tin.a( [roa 157 (5)) = F 


n—-1 


18. Compute the limit lim n| Z oe 
K n == =e $ 
E mo (4 Om +k 


19. Are there polynomials P, Q with real coefficients satisfying the equalities 


POD r eE arog 
x = = m ies =. 
o Q(x) 2 3 n 


for each integer n > 2? 
20. Let f : (0, 1] — R be a continuous function such that 


1 1 
lim J fÒdt = f f(adx 


exists and is finite. Does it follow that the sequence (S,,),>1, where 
tee nek 
Sn = —], 
AG 


1 
converges to / f(x)dx? What if f is decreasing and lim xf (x) = 0? 
0 P ar oa 
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21. Let f and g be two functions defined on a compact interval [a, b], such that f is 
Riemann integrable on [a, b], g has an antiderivative G on [a, b], and f(x) < g(x) 
for all x € [a, b]. 
b 
a) Prove that f f(x) dx < G(b) — G(a). 
ae 
b) Prove that if | f(x)dx = G(b) — G(a), then g is Riemann integrable on 
la, b]. A 
22. Let g : [a, 6] — R be a Riemann integrable function, and let f : [a, b] > [«, 8] 
be a continuous function having the property that for every a’ € (a, b], there 
exists an L > 0 (depending on a’) such that |f(x) — f(y)| > L|x — y| for all 
x,y € [a’, b]. Prove that g o f is Riemann integrable on [a, b]. 
m/2 u/2 
23. Let A, = | cos?” xdx and B, = f xX cos” x dx. 
0 0 
1 
a) Prove that A„,—1 — An = moi" and that A, = n(2n—1)B,—; — 2n7B,, for 
n— 
alln > 1. 
b) Conclude that 
1 Br-1 Bn 
BIER) ae 
n? An-1 Ay 
for all n > 1, and then deduce that 
Co 
1 r? 
2)= ie 
62) 2 Aa 
Solutions 
1. a) If f(x) < g(x) for all x € [a,b], it follows easily that oa, (r. go) < 


OA, (s. gf) ) for every n > 1, where A, is a sequence of partitions of [a, b] 


with lim ||A,|| = 0, and Ew are some corresponding intermediate points. 
n—>>Co 


Thus the result follows by passing to the limit for n — ov: 


/ TOi Jim os, (f.&") < lim oa, (8.4) = f ‘Oe 


Note that, in particular, the integral of a nonnegative function is also 
nonnegative. Also, one can show that if continuity on [a, b] is assumed for f 
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and g, the equality in the inequality that we proved is achieved if and only if 


f= gon [a,b]. 
b) Let m = in f(x), and M = sup f(x) be the extrema of f on [a, b]. The 
xEļa, x€[a,b] 
extreme value theorem tells us that m and M are actually values assumed by 
f. The monotonicity of the Riemann integral applied to the functions f — m 


and 0 to f — M and 0, respectively, yields 


b 
m(b—a) < / F(x) dx < M(b—- a). 


Thus 


1 b 
=) f(x) dx 


is between m and M—two values of f; now the intermediate value property 
of f ensures the existence of c € [a, b] such that 


1 b 
— f Tode sO. 


We used this result in the proof of the fundamental theorem of calculus. So, 
now we know that if f is continuous on [a, b], then the function 


xe F(x) = f to dt 


is an antiderivative of f. Thus the equality from the mean value theorem reads 
F(b) — F(a) = (b — a)F' (c), that is, it can be inferred from Lagrange’s mean 
value theorem applied to F. This shows that c can be actually chosen in the 
open interval (a, b). 


. For every integer 1 < k < n, we have the estimations 


Vn? +1 <yn +k< yn +n, 
So 


1 ” kx 


1 T. kr 
——— 9} sin — < an < ———— sin —. 
vn +n n n+l n 


Now let us consider the function f : [0, 1] > R, f(x) = sin zx. The function f 
is continuous, so it is integrable. It follows that 


Solutions 


Jin or (G Ja [roa 


1 


n 
ae | _ kr 
lim — J sin — 


0 


í 1 
i, sin mx dx = —— -cosmx 
0 TT 


Because 


n LO. kr 
. sin ine 
vne +n 22 no Seer 3 n 


2 
it follows that lim a, = —. 
noo It 


3. We have 
t= 1 
an = ’ 
Mar F+ 14+ 4445 
so 
Die 1 ee 1 
DDr k k = <an<7 De / E 
= aoe aoe Et oa k=l 5+ 1+ 5 


The numbers 


n n n n n n 


k k |: =" k E =] 
+ 2 E ’ ’ = ’ 
n 
can be viewed as systems of intermediate points associated to the division 


1 2 n—1 
A=[0<-<-<.< <1 
n n 


n 


1 
and the function f : [0, 1] —> R, f(x) = t————_.. Thus 


ji i: dx 
im a, = oo 
m5 ee: 


4. The answer is no. Supposing the contrary, consider the division 


1 2 n 
Anr=[0<-<-< <- = ,n>1 
n n n 
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of the interval [0, 1] with the norm ||A,,|| = 1 — 0, as n — oo. According to 
the hypothesis, we can find 


k—1 k 
Ek ne €|—,-|, 1<k<n 
n n 
such that 
f@=& FO) = m 
Hence 


iene eo | 
im ~ Y>F(&i) = lim EY f = f fee) 
n>œ n = n>œ n = 0 


And yet, these limits are different, so the problem is solved. Indeed, consider 
the sum 


1S es k—1 k 
POR be =.=) 


as a Riemann sum associated to the continuous function ¢ : [0, 1] > R, é(@%) = 
x. Therefore 


Te l | 1 
lim — a= f dea [Pde = 5. 
ee , 20 1 3 


Similarly, if we consider the continuous function wy : [0,1] > R, w(x) = xX, 
then 


jes f 1 1 
lim — A da= f ax= 5. 
2a Swede =f ; 


. First note that the sequence 


is convergent and 


1 
lim a, = f f (x) dx. 
n> 0 
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If p is prime, then 


In conclusion, the convergent sequence (a,),+,; has a subsequence which 


1 
converges to zero. Hence a, — 0 and f F(x) dx = 0. 


0 
6. We can prove by induction (using the same trick as the one Cauchy used to 
prove the AM-GM inequality) that 


fæ) + f (x2) +e EFi) < f (3/x1X2 7 Xn) 


n 
for all positive integers n and all x1, x2, . . . , Xn. Thus 
1 k ijl 2 n Yn! 
ní \n nn n n 
Now, using the well known limit lim nl = e!, we obtain 
noo 


[1 dx = tim an < Jim f (=) =f (e). 
7. For € > 0, denote by 
Ae = {x € [a,b] | FŒ > £ 
and assume that card A, = ns. Obviously, f is bounded so let 


M= sup fœ]. 


xE[a,b] 
Let 


A = (a = xo <x, < -+ < Xn = b) 
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be a division of the interval [a, b] and let & € [x,~1, xk], 1 < k < n be a system 
of intermediate points. We have 


< SOU E)| Ge — x) 


k=1 


POSED Or- a1) 


k=1 


= $ IE) ea) + DS FED) ak- xe) 


ék EAe EkÉAe 


<HA YO FED + e- XO Or- x1) < Mne- ||Al] + elb — a). 


EkEAe EkÉAe 


E 
Now, if || A|| < —, then 
Mn, 


0< DOSED r — x-1) <eļlb-a+ 1) 


k=1 


1 
and consequently, f f(x) dx = 0. 


0 
. For each integer n > 2, let us consider the equidistant division 


1 2 n—-1 
An, ={0<-<-<:::< <1 
n n n 


of the interval [0, 1] . Define the system of intermediate points 


i k-1k A j 
EP e |=. 4 » with fP) = 0, 
non 
for all integers 1 < k < n. The norm ||A,|| = 1 tends to zero, as n — OO, so 


1 
the corresponding Riemann sum tends to f f(x) dxasn > œ: 
0 


Ie l 
lim — @) =f dx. 
lim, - dS y= | fe) dx 
But the sums are identically zero, because all their terms are zero. Thus, 


[roaa 


Solutions 


9. 


10. 
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Consider the division 


1 2 —1 
by =(0<2 <2 <4 <1) 
n n 


n 


of the interval [0, 1]. According to the hypothesis, we can find 


so that f(&?) + f(n}) = 2. From the fact that f is integrable, it follows that 


1 PE l 
im ~ FE) = im — ro) = f fO dr 
k=l Mia 0 


n>œ Nn 


Therefore 


l diese 1$ 
2 f FG) dr = lim T DFG) + im, | DVM 


1 n 
jim -JO FED +D) = 2 
k=1 


1 
so f f(x) dx = 1. 


Let us consider the divisions 
An = (0 < an < 2an < +++ << nan <1), n> 1 
of the interval [0, 1]. Easily, 
||A,|| = max {an, 1 — nan} > 0, as n—> oœ. 


The functions f, g are increasing, so they are integrable on [0, 1]. If we choose 
the points 


& = kan € [(k — 1)an, kan], 1< k <n, 


then 


n n 1 
lim X` f (kan) (kay — (k — Van) = lim an X f (kan) = I f(x) dx 
noo a noo PE 0 
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and 
n n 1 
im 3 g(kan) (kay — (k — 1)an) = im An 2, slka) = Í g(x) dx. 


Finally, 


J Ska) 22 WOE 


lim ——— = lim 


ue noo n 1 j 
J skan) an Y g(kan) Í g(x) dx 
k=l k=1 


11. The function f is bounded and continuous almost everywhere (continuous on 
[0, 1] \ {1/7 | n € N*}). In order to compute the integral, we will choose the 
particular division 


1 2 n—1 
A=[0<-<-<.< <1 
n n 


n 


and the system of intermediate points 


ie | FARO, Iskan 


,— 
n 


We have f(&,) = 0 and 


1 n 
| fovav= im Ero. 
k=1 


More generally, any integrable function that vanishes on a dense subset of [a, b] 
has integral equal to 0 (as we have seen in a previous exercise). 
12. We have 


1 n 
f f(x) ->>9() 
9 Mier \" 


n k 1 n k 
2 ftos SA) 
n k k 
ro-se 
Dji Go 
EE) 
k=1° n 
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ERO AZ) 


_ fQ)= FO) 


n 


13. First we can prove by induction that n — 2 < a, < n. Under this assumption, 
we have 


anı = V1 + na, > Jl +tnn—2)=n-1 
and 


Qn41 = Vl tna, < V14+n? <n+1, 


; me a 
son— 1 <d,4; < n+ 1. These inequalities show that — converges to 1. Let 
n 


us consider the function f : [0,1] —> R, given by the formula f(x) = = 
x 
x € [0, 1] and the division 


a 2a na 
A=(0<—2 «<<... < <1). 
n2 n2 n? 


For the system of intermediate points 


Ej = 


2 2 i 


kan _[(k—1)an kan 
wef Ja Fiske 
n n n 


we have 


| . Z kan kan (k—l1)an 
[sor mis (") (2-6) 


n n 
iav Ay ka, O a k 
= lim 45 =—— = lim 49 = _. 
n> n? n +kan non cn + kan 


In conclusion, 


n 


. k ! 


k=1 


14. Let ¢ > 0. For every division 


A = (a = xo <x, < + < Xn = b) 
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with norm ||A]| < 6(€), we have Sa (g) — sa(g) < £. Let us denote 


m= inf g(x), M= sup g(x) 


x [xe—1 xx] x€ [x1 xk] 
and 


m. = inf f(x), MÍ = sup f(x). 


x€ [x1 xk] x€ [xx—1 xk] 
From the given inequality, we deduce 
MÉ — nl, < MÈ — mf, 1<k<n. 


Therefore 
Saf) — saf) = DUA = m) (Ek — X1) < XO ME = mi) Ek — x1) 
k=l k=l 


= Sa(g) — sa(g) < €. 


In conclusion, f is integrable. Alternatively, observe that the hypothesis and the 
fact that g is bounded easily imply that f is bounded: all we need is to note 
that FÆ] < If(@| + lg@| + |g(@]. Also, for an xo € (a, b), the inequality 
If (x) —f(xo0)| < |g(x) — g(xo)| implies that f is continuous at xo if g is. Thus, 
the discontinuity points of f are included in the set of discontinuity points of g 
and we can apply Lebesgue’s criterion in order to complete the solution. 

15. The function f is integrable, so it is bounded. Assume that |f(x)| < M for all 
x € [0, 1] and for some M > 0. For every integer 1 < k < n, we have 


OEO 
OE O 


By adding these inequalities, we obtain 


HOGHE] 


< 2M 


and thus 
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Next, 


a a 


< Saf) — saf) > 9, 


where Sa (f) and sa (f) are the Darboux sums corresponding to the equidistant 
division 


Thus 


and the problem is solved. 


16. Let us define the sequence (b,),>9 by the formula b, = 5 ax, with bọ = 0. 
k=1 
By using the Abel-Dirichlet summation method, we have 


SOLE e 
LOOO- 
pE rojo 
OOO) 
+t P(E) -0| + ro 
FOO) 


HE) olro], 


1 
n 


1 


n 
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which tends to zero, as n — oo, because, as we have seen in the previous 
problem, 


fen LD-FOIAVEA)-F@l+ +O o 


noo n 


i =( roa- 15r (5) 
t D) 
ANESSE 


1 n—-1 f k 
$5, Dut (=). 
k=0 


Pick € > 0. Because f” is continuous on [0, 1], it is uniformly continuous, and so 
there exists mo such that for all n > no and all x, y such that |x— y| < L, we have 
If’ (x) —f’()| < e. From now on, we consider n > no. For a fixed x € [0, 1], 


Lagrange’s theorem asserts the existence of a point Ck,n,x € [5 „x+ k1] such 
that 


f (=+ 1) Sf (>) le 


Because eve — 1) < 1 we deduce that 


n aman Oe 


he) ) 


for all k, x. By integrating this between 0 and L, we deduce that 


Lene E) 


17. We have 


KIES 


E 


s 2n?’ 
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which shows (by summation) that the term 


>> a are el or ed rw) 


-1 
Lo, (k 
converges to 0. Because the term — a f | — | converges to 
2n rast n 


‘pode _ f0) -F) 
=. 


the conclusion follows. 
18. Let us consider the function f : [0,1] —> R, given by f(x) = 
According to the previous problem, 


1 n—-1 
1 k 1)—f(0 
lim n i f(x) dx— 2 ) _ fM- a 
noo 0 n ar n 2 
which can be written as 
T so n 1 
tin.» (3 Vata] Sry 
19. The answer is no. We have 


lnn P 1 1 1 
f (Fe a1) acm 54 5 tet -mn 
à n 


TEA x € [0,1]. 


O(n) 2'3 
or 

nn Rix 

| oan 
where R(x) = P(x) — Q(x) and 

23 n 


It can be easily proved that 


1 
lim n? (cny = cn) = ==, 
n>oo 2 
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using 


Further, 


——dx = Cn+1 — Cn 


f" 1) R(x) 
Inn Q(x) 


and, according to Leibniz-Newton’s theorem, there exists &, € (Inn, In(n + 1)) 
so that 


An(n + 1) —Inn)- aE = C41 — Ci 
Equivalently, 
Bi (+8) ons 
or 
AR(En) _ 1(Cn41 — Cn) 
Qn) m(t"? 
so 


nR(En) 1 


lim ; 
noo O(En) 2 
With k = deg Q — deg R, we have k > 1, because 


REn) _ 


oe Q(En) 


Furthermore, the limit 


tim aR Gn) 


n> O(E,) 


is finite and nonzero. It follows that 


2 ~ woo OE) n> OE) woe ER 
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hence the limit lim E is finite and nonzero. But this is a contradiction, because 
noo Sn 


n n 
E = — 
Ek ~ In*(n+ 1) 


20. The answer to the first question is negative. Indeed, for positive integers n, the 
intervals 


E 1 1, 1 
"O (27 2.40 2n 2.4n 


are pairwise disjoint; thus one can define a continuous function which is 
piecewise linear on these intervals, zero between them, and such that 


1 -y 
s(5)= 


and f vanishes at the endpoints of [,,. Then f is positive, 


TETS 
a = al \ on J = 7 


and one can easily check that 


which converges to i As expected, the second question has a positive answer. 
The fact that f is decreasing implies the inequality 


£ real 
f sonst (=) > f OA 


This implies the inequality 
1/1 A 1 l 
-f= ] + | Sd Saz —f()+ | f@)dx, 
n\n 1 n 1 


which combined with the hypothesis lim 1 f (+) = 0 gives the desired result. 
n—>Co 
21. (Sorin Radulescu, District Mathematical Olympiad, 1984) 


a) Let 


282 15 Riemann and Darboux Sums 


be a sequence of divisions of the interval [a, b], with lim, ||An|| = 0. 
For every positive integer n and every 1 < i < k,, choose in the interval 
[x7_,, x7] the point £” (whose existence is ensured by Lagrange’s mean value 
theorem) having the property that G(x?) — G) = Q7 — x7, )g(&?). We 
then have, for each n > 1, 


kn kn 
0, (f,8") = > oF — x FE) < DG? — x1) (&) = 


i=1 i=1 
= X (G) — GŒ) = Gb) — Ga). 
i=1 


Therefore, by passing to the limit and using the definition of the Riemann 
integral, 


b 
[ foods = iim ont," < GO) - Ga) 


as we intended to prove. 
b) Clearly, one can prove, in the exact same way as above, that 


i, f(x) dx < Gib’) — Gd’) 
for alla < a' < b' < b. Thus, fora < œ < $ < b, 


f “Fj EOE 
B 
f F(x) dx < G(B) — Gla), 


b 
[ soar = 60) - 608) 
But we also have, by hypothesis, 
a B b b 
J fœ)dx+ [ f(x)dx + J fœ) dx = / f(x) dx = G(b) — G(a) 


= G(a) — G(a) + G(B) — G(a) + G(b) — G(B): 
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22. 


therefore all the above inequalities must be, in fact, equalities. This shows that, 
actually, ee f(x)dx = G(b') — G(a') for alla < d' < b' <b. 

Now, for such a’ and b’, and any (arbitrarily chosen, but fixed for the 
moment) xo € [a’, b'] and any x € [a’, b'], we have that 


G(x) — Go) _ Saf at 


XxX — Xo X — Xo 


is between the bounds of f in the interval [a’, b']. Thus, the limit of this ratio for 
x — Xo (which is g(xo); if xo is one end of the interval [a’, b’], we only take a 
one-sided limit) is also between these bounds; therefore 


ef f(x) < g(xo) < sup f(x) 
a’,b’| x€[a’,b’] 


for every xo € [a’, b']. We deduce that 


inf fœ) < Eo g(x) < sup g(x) < sup f(a) 


x€[a’,b’] a',b'] x€[a’,b’] x€[a’,b’] 


for any subinterval [a’, b’] of [a, b]. (Of course, the first of these inequalities 
also follows from the hypothesis f(x) < g(x) for all x € [a, b], but that is not the 
case with the last inequality.) Now it’s an immediate consequence of the above 
inequalities that 


sa(f) < sa(g) < Sa(g) < Salf) = Sa(g) — sa l8) < Saf) — Saf), 


for any partition A of [a, b], and the Darboux criterion for Riemann integrability 
shows that the Riemann integrability of f implies the Riemann integrability of 
g, finishing the proof. 

Solution I. First we see that f is injective on (a, b]. Indeed, for some x,y € 
(a, b] choose an a’ > a such that x,y € [a’, b], and let L be the corresponding 
constant for this interval. If we assume f(x) = f(y), the inequality |f (x) — 
f()| = Llx — y| shows that we necessarily have x = y. Being continuous and 
injective, f is strictly monotone on (a, b], and, by continuity, it is actually strictly 
monotone on [a, b]. We can assume, without loss of generality, that f is strictly 
increasing and that [a, 6] is actually the image of [a, b] under the continuously 
increasing map f (so that f(a) = «œ and f(b) = £). 

Consider an arbitrary € > 0, and let M > 0 be such that |g(x)| < M for all 
x,y € [a, $]. Let a’ = minfa + €/(4M), b} and let œ’ = f(a’). Consider L > 0 
with property that |f(x) —f(y)| > L|x — y| for all x, y € [a,b]. 

Because g is integrable on [a, £], it is also integrable on [a’, 6], hence we 
can find a division A’ = (a@’ = y} < +- < y, = $f) of the interval [a’, 6] 
such that Sa (g) — sa’ (g) < Le/2. Of course, for each y;, there exists a unique 
x; € [a, B] such that f(x;) = y; and, in fact, a’ = xy < x2 < -+> <x, = b. 
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Now we consider the partition A = (a = x9 < xı < +-+- < Xn = b) of [a, b] 
and compute 


Sa(g of) —sa(g of) 


= (a! — a)(Mi(g of) — mi (gof)) + Yi — 1) (Milg of) — mg of), 


i=2 


where M;(g o f) and m;(g o f) are the respective upper and lower bounds of 
g o f in the interval [x;—1, x;]. One sees immediately that M;(g o f) = M,(f) and 
m;(g of) = m;(f), where M,(f) and m;(f) are the corresponding bounds of f in 
the interval [y;—1, yi], for i > 2. For such i we have x; — x1 < (/DF œ) - 
f(x%-1) = (1/L)(0; — yi-1); therefore the above expression of the difference 
between the upper and lower Darboux sums of g o f on the interval [a, b] can be 
evaluated as follows: 


1 n 
Sa(g of) = salg of) < (d — a)2M + 7 Y 0i — yi1) M:8) — mig) 
i=2 


1 
= (d — a)2M + 78a’ (g) — sa’(g)) 


€ 1 Le 
< — -2M+—-— =e. 
4M L 2 
Thus, for every € > 0, there exists a partition A of [a, b] such that S4 (g o f) — 
sa (gof) < e€, which means that g o f is Riemann integrable on [a, b], finishing 
our (first) proof. 


Solution II. We use the Lebesgue criterion for Riemann integrability. As in 
the previous solution, we may assume that f is one to one from [a, b] onto [a, £] 
and strictly increasing. Of course, g o f is bounded, so all we need to prove is 
that g o f is continuous almost everywhere. We observe that Dgof C f~! (Dg), 
where by D, we denote the set of discontinuities of the function A. (This is just 
another way to put the fact that if g is continuous at some point of the form 
Ff (x0), xo € [a, b], then g o f is continuous at xo.) Thus it suffices to show that 
f—!(D,) is a null set. 


Let € > 0 be given, and let c = min{a + €/2,b}. Consider y = f(c) + 
c = f~! (y), and L > 0 such that |f(x) — f(y)| > L|x — y| for all x, y € [e, b]. It 
follows that |f—'(x) —f~!(y)| < (1/L)|x—| for all x, y € [y, B]. The set D, can 
be covered by the union of intervals [a;, bi] whose sum of lengths is less than 
Le/2, and we can assume that each and every such interval is included in [a, £] 
(because D, is included in this interval; therefore it can also be covered by the 
union of [a;, bi] N [w, 8] —whose sum of lengths is at most equal to the sum of 
lengths of [a;, b;]). We have 
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23. 


D; © Ula: b: >f ' (D) € ORORO) 


= f (D;) © [a,c] U (Uirae Ale, m) . 


Each intersection [f~!(a;),f—!(bj)] N [c,b] is either empty, or reduced to 
one single point, or [c,f'(b)] = [f~'(y).f7'(b)], or the whole interval 
[f ! (ai). f7! (b;)]. In the first two cases, the length of such an interval is zero, in 
the third case, its length is 


Fe-S) < ei- 1) < o-a), 


and in the fourth case, its length is again at most (1/L)(b; — ai). Thus we 
managed to cover f -1 (D;) with a set of intervals whose sum of lengths does 
not exceed 


1 Le 
— =eę, 
2 


1 € 
= F Ba) SoS 
c ae aoe a) S3 t7 


which is what we intended to do. 


Remarks. 1) The result remains true if f satisfies the weaker condition: there 
exists L > 0 such that |f (x) —f(y)| > L|x — y| for all x,y € [a,b]. Any of 
the above proofs can be adapted to prove it in this form, with an obviously 
simpler wording of the demonstration. 

2) We have already seen that the composition g o f of a continuous g with 

an integrable f is integrable. This result shows that we can ensure the 

integrability of g o f when g is integrable, and f is continuous, if we add 
some extra condition for f (some kind of a reverse Lipschitzian property). 

When g : [0, 1] > R is an integrable function, the function h : [0, 1] —> R 

given by h(x) = g(x’), for all x € [0,1] is also integrable. We had 

this problem, proposed by Sorin Radulescu, in the National Mathematics 

Olympiad, as 12th grade students in 1986. 


3 


wm 


(See, for example, Daniel Daners: A Short Elementary Proof of X` 1/k? = 
x? /6, in Mathematics Magazine, 5/2012, pp 361-364, where many other good 
references are given.) This is actually not about Riemann sums (or Darboux 
sums)—but the Riemann integral appears so beautifully in the evaluation of the 
celebrated sum of the inverses of the squares of positive integers that we can’t 
miss it. 
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a) The first part is well known. Integration by parts solves it immediately: 


m/2 
An—1 — An = f cos”? x sin? x dx 
0 


1 m/2 
E 7 (cos?™”™! x) sin xdx 
n— 0 
1 ( 2n—1 5 x/2 fe 2n—1 
= cos xsinx f — cos x: cosxdx 
2n = 1 0 
1 
= ——— Ay. 
2n—1 
2n 


Note that A,_; = A, follows. Integrating by parts (twice), we can 


n— 
get the second formula. Namely, we have, 


m/2 
2 = ; 
An = x cos” x iP -f x- 2ncos™! x- (— sin x) dx 
0 
m/2 
= nf PY cos™™! x sin x dx 
0 


m/2 
= . 2 = è 
= nx’ cos”""! xsin xdx a -n f x (cos™” x—(2n—1) cos””~? x sin? x) dx 
0 


= —nB, + n(2n — 1)(Bn-1 — Bn) = n(2n — 1)By-1 — 2n’Bn. 


b) By dividing the second equation from the first part by n?A,, and by using 
2n 
57 An = An-1, 
2n— 1 


we get exactly 


1 Br-1 Bn 
—=2 Sele 
n? An-1 An 


Summing up from n = 1 ton = N we get (by telescoping) 


ye Bo By =) n? Bn 
Zn" \An An) U2 An) 
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Now we use the well-known inequality x < > sinx, valid for x € (0, 2/2) 
(it becomes an equality at the endpoints of the interval; the inequality actually 
expresses the geometric fact that the graph of the concave function x > sinx 
is situated above the chord that connects the points (0,0) and (7/2, 1)). 


Accordingly, we have 


m/2 m2 m/2 
Bn f xX cos” x dx < en Í sin? x cos” x dx 
0 0 


II 


m 2 
— f (1 — cos? x) cos” x dx 
4 Jo 


N 


T 1 
sce Ay 
4 2n+2 


2 
x 

= —(A, —A, == 
m +1) 


Putting together the last two results, we get 


m &1 n? 
0< — < 3 
gon 4(N + 1) 


n=1 


and the desired conclusion follows by making N tend to infinity. This is one 

classical approach to evaluate the value ¢(2) of the Riemann zeta function (the 

celebrated Basel problem, solved by Euler in 1735, at the age of twenty-eight). 
Note that if, instead of the integrals A, and B,, one would rather work with 


m/2 n/2 
Gr = f cos”"t! xdx and D, = i xX cos”™! xdx, 
0 0 


then one would obtain 


2 Oon? 1 Dy 
m, a 8 2 C` 


The ratio Dy/Cy can be shown to approach zero, as N goes to infinity (in the 
same manner, we proceeded with By/Ay), and hence 


oo 2 
Soins 
= aaa 8 


can be proved in this way. This is no surprise: the latter equation is well known 
to be equivalent to €(2) = 27/6. 


Chapter 16 
Antiderivatives 


An antiderivative of the function f : I C R — R is a differentiable function 
F : I — R, such that F’ = f. If F is an antiderivative of f, then F + c is also an 
antiderivative of the function f, for every real constant c. In fact, when / is an interval 
(and in most cases it is) these functions F + c,c € R, are all the antiderivatives of 
f. (This is because if F and G are two antiderivatives for the same function f on 
the interval J, then the derivative of G — F vanishes on Z; therefore the difference 
G — F must be a constant. Note the importance of the fact that J is an interval.) We 
denote by 


[resac= tr telceR) 
the set of all antiderivatives of the function f. Starting with the classical formula 
(fg) =f'3 + fe 
one obtains the formula of integration by parts, 
[ Fereear =fereco - [ fore’ Coar. 


We recall the formula of change of the variable, which can be deduced from the 
formula of derivation of composition of two functions, 


FEON = f'E) w). 


More precisely, 


f Teo ors Te aaa 
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The existence of antiderivatives of a given function cannot be taken for granted—in 
general—and we will see below examples of functions that fail to have antideriva- 
tives. However, as an immediate corollary of the fundamental theorem of calculus 
(see the previous chapter), it follows that continuous functions have antiderivatives. 
Namely, if f : J —> R (where / is an interval) is continuous, then any F defined on 
I by 


F(x) = I OEE 


(witha € J, and c € R) is an antiderivative of f. Note that, although the above 
expression of F may give the impression that we can evaluate the antiderivatives of 
any continuous function, this is not at all the case. For example, the function 


F(x) = e dt 
0 


. : : . . zye . 
is definitely an antiderivative of x > e`™* . However, there is no way to express 


this (or any other antiderivative of x b> e*) by means of elementary functions. 
Nevertheless, such expressions of the antiderivatives of continuous functions can be 
useful in various problems. 


Problem. Let f,g : I C R — R, f with antiderivatives and g € C!, that 
is, g is differentiable with continuous derivative. Prove that the function fg has 
antiderivatives. 


Solution. Let F be an antiderivative of the function f. Then by the formula 
(Fg) = fg + Fe! 
we deduce that 
fg = (Fg) — Fe’. 


The function Fg’ is continuous, so it has antiderivatives. Hence fg has antiderivatives 
as a difference of two functions with antiderivatives. O 


Problem. Let /, g : R — R be functions with the following properties: 


i) f is continuous and bijective. 
ii) g is nonnegative and has antiderivatives. 


Prove that fg also has antiderivatives. 
Solution. Let G be an antiderivative for g and let H be an antiderivative for 


G(f—!(x)). We claim that the function 


F(x) = f)G(x) — HF) 
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is an antiderivative for fg. Consider a real number xo. Let 


X— Xo 
Observe that we can also write 
S(x) = f(x): CO) Ca) + Gl) - L —fQo)  HF)- H (F (x0)) 
=x0 — Xo xX — X09 


Using the mean value theorem, we can write 


HF) -HF o) _ = H' f(e) - a —f Qo) 


x — Xo — Xo 


for some c(x) between xo and x. The choice of H gives H’(f(c(x))) = G(c(x)). In 
order to prove that S(x) tends to f(xo)g(xo) when x tends to xo, it is enough to show 
that 


G(x) — G(cQ)) 


X — Xo 


is bounded in a neighborhood of xo (we use here the continuity of f). However, this 
is not difficult, since G is increasing (because G’ = g > 0) and 


ee = Ge) | ES -69| 


X — Xo 


X — Xo 


The last quantity is bounded as x tends to xo, by the differentiability of G at xo. This 
finishes the proof. O 


Problem. Let : R — R be a continuous function such that the following limit 
exists: 


oe F f(t)dt = I(f). 


kl 


Prove that the function g defined by g(0) = J(f) and g(x) = f (4) if x Æ 0 has 
antiderivatives. 


Solution. Let F be an antiderivative for f. Then we clearly have for all x 4 0 


(¥(2)) =) -10 


Now, consider the function h defined by h(0) = 2/(f) and h(x) = 2xF (4) for 
nonzero x. The hypothesis of the problem implies that h is continuous, so it has 
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an antiderivative H. We can write g as the difference of two functions having 
antiderivatives g = H’ — U’, where U is defined by U(x) = x F(4) ifx # 0 
and U(0) = 0.0 


We have seen several conditions under which a function has antiderivatives. But 
how can we prove that a function does not have antiderivatives? The easiest and 
most practical way is the observation that any function which has antiderivatives 
also has the intermediate value property. This follows from Darboux’s theorem, 
stating that for any differentiable function F, its derivative has the intermediate 
value property. The proof of this result has been presented in Chapter 14. There are 
countless examples of situations in which this criterion works very well, but we will 
limit ourselves to one problem. Before that, note however that there are functions 
having the intermediate value property which do not have antiderivatives. We let 
the reader check that such an example is given by the function f(x) = 1 cos(+) for 
x Æ Oandf(0) = 0. 


Problem. Prove that any function f : J —> R defined on a nontrivial interval J 
and satisfying f (f (x)) = —x does not have antiderivatives. 


Solution. Suppose that f has antiderivatives. Therefore, f has the intermediate 
value property. Note however that f is injective, because if f(x) = f(y), then —x = 
fE) = fF)) = —y and so x = y. Therefore f is monotone. This implies that 
Ff (f(&)) is increasing, which is impossible, because g(x) = —x is decreasing. This 
contradiction shows that f cannot have antiderivatives. O 


Here is a more subtle problem, which needs several arguments. 


Problem. Prove that there exists no function f : [0,1] — [0,1] having an 
antiderivative F : [0, 1] — [0, 1] such that F(f(x)) = x for all x. 


Solution. Suppose, by way of contradiction, that f is such a function. Clearly, 
f is injective because f(x) = f(y) > x = F(f(x)) = F(O)) = y. Because 
f is injective and has the intermediate value property, f is strictly monotone and 
continuous. Because F’ = f > 0, F is increasing. We also deduce that f is 
increasing. 

Suppose now that f(0) > 0, thus F(f(0)) > F (0), thus F(0) < 0, a contradiction. 
Thus f(0) = 0, so 0 = F(f(0)) = F(0). Similarly, f(1) = F(1) = 1. Now take 
x € (0, 1); then Lagrange’s theorem gives c,, cz € (0, 1) such that 


F(x) — FO) = xf (c1) 
F(1) — FQ) = (1 — x)f(c2) 


Thus F(x) < x and 1 — F(x) < 1 — x, which is impossible. Therefore such a 
function cannot exist. LI 


We continue with a functional equation involving antiderivatives. 
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Problem. Let a be a nonzero real number. Find all functions f : (0,00) —> R 
having an antiderivative F such that 


FOF (<) =<. 


1 
Solution. Replacing x with —, we also obtain 
x 


(=) F(x) = ax. 
x 


Thus 


19 
siro = “rer (=) = £ E A 


This implies 


(In |F(x)|)’ = — far (2)]. 
x 


1 
so there exists a constant c such that F(x)F (=) = c. Replace this in the statement 
x 


of the problem to obtain 


Fœ) a pe 
Fs) = Or 


1 a 
for some b. Because F (<) = bx~<, we deduce that b = ,/c. Thus the solution of 
x 


the problem is f(x) = 


a_ 
xe! for some constant c > 0.0 
c 


Problem. Let f : R — R be a continuous differentiable function, strictly 
monotone, such that f (0) = 0. Prove that the function g : R —> R 


7 . 1 
rote Poia: xÆ0 
0, x=0 


has antiderivatives. 
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Solution. Let us observe that 


(Pocos 5) SOP (x) cos 7 yoa n 


which can be also written as 


Ton =(7 OF: si) ~ 2 (2)f'(x) cos 


1 
re ) FO FQ)’ 


Consider 


, i 
ma) = | BVO Foy x#0 
0, MER 


which is clearly continuous. Take H to be an antiderivative of h and consider 


G(x) = ee cos =z — H(x), xe R* l 
_H(0), x=0 


Note that G is continuous because f is and f (0) = 0. Also, 


f’ Œœ) cos —~ 
fim EO — GO) n © 


x0 x x0 X 


= lim Or ) cos s — He) -H0 


= f'(0) lim f(a) cos re — n(0) = 0. 


— H(x) + H(0) 


Thus G is differentiable and G’ = g. O 


Proposed Problems 


1. Find the integrals 
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2 
2. Let F : R > R be an antiderivative of the function f : EB = — R with 


Prove that 


3. Let f : [0,22] —> R be continuous and decreasing such that f(x) = 0. Prove 
that for every antiderivative F of the function f, we have 


2m 
f F(x) cos xdx < 0. 
0 


4. Show that if f : R — R has antiderivatives, then the same is true for h : R —> R 
defined by h(x) = |x|f (x) for all x. 
5. Prove that the function f : R — R given by the formula 


1 
_ Ísin, x40 
f@) = eC 
has antiderivatives. 
6. Let f : R — R be such that one of its antiderivatives F has the property that 
lim F(x) = 0. Prove that the function g : [0, o0) —> R given by the formula 
X7>—OCO 


_ J fdnx), x>0 
g(x) = 0. jak 
has antiderivatives. 
7. Let f : R — R be such that the functions g, h : R > R, 


g(x) = f(x) sinx, h(x) = f(x) cosx 


have antiderivatives. Prove that f has antiderivatives. 

8. For a function f : R — R and for every positive integer n, denote by fyn the 
restriction of the function f to the interval (—n, co). Decide if the following 
assertions are true: 


a) If the functions fp have the intermediate value property, for all positive 
integers n, then the function f has the intermediate value property. 

b) If the functions f, have antiderivatives, for all positive integers n, then the 
function f has antiderivatives. 
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10. 


11. 


12. 


13. 


14. 


15. 


. Find an antiderivative of the function g : (0,00) —> R, g(x) = e 
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c) Answer similar questions for a function f : [0, 00) —> R with its restrictions 
Ja on the interval [1/n, o0),n € N*. 


+2 if we 

know an antiderivative F of the function f : R > R, f(x) = e. 

Find an antiderivative of the function f : R > R,f(x) = e*, if we know an 
1 

antiderivative G of the function g : (0,00) > R, g(x) = eta, 

Let f : (~x /2, 7/2) — [0, 1] and let us define the function g : (~m /2, 1/2) > 


R, by the formula 


g(x) = f(x) tan x. 


Prove that: 


a) Iff has antiderivatives, then g has antiderivatives. 
b) If g has antiderivatives and f is continuous at zero, then f has antiderivatives. 


Are there differentiable functions f : R — (0,00) such that Inf is an 
antiderivative of ln F, for some antiderivative F of f? 
Let f : R — R be a function with antiderivatives such that 


F(x) = fF) —»), 


for all real numbers x. Prove that the function f is continuous. 
Let f : [0,œ) —> R, with f(0) = 0, continuous on (0, 00), satisfying the 
following properties: 


a) there exist positive real numbers r and M such that |f(x)| < M, for all real 
numbers x € [0, r]; 
b) there exists a decreasing sequence (x,)n>0, Converging to zero, with 


- Xn+1 
lim Z= = 1, and such that 
n> Xn 


J E TS 


for all positive integers n. Prove that the function f has antiderivatives. 
Consider the function f : [0, o0) —> R, with f(0) = 0, continuous on (0, co) 


and linear on each interval of the form | ———, — | ,n > 1 such that 
n+l n 


si) =e" 
n 


for all positive integers n. Prove that the function f has antiderivatives. 
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16. Let f be a real function with an antiderivative F on R. Prove that if 


|x| 


f(x) < +f 


for all x € R, then F has a unique fixed point (1.e., there exists precisely one 
xo € R such that F (xo) = xo). 


17. Let f,g : R > 


R be two continuously differentiable functions such that f” is 


bounded, lim,-++60 f(x)/x = 0, g(x) = 0 if and only if x = 0, and g’(0) Æ 0. 


Prove that h : R 


has antiderivativ 


— R defined by 


HESE for x £0 


0, for x= 0 


h(x) = | 


esonR. 


18. (Jarnik’s theorem) Let f, g : R — R be real functions that have antiderivatives 


and such that g 
property. 


Solutions 


is nonzero on R. Show that f/g has the intermediate value 


1. First let us compute 


fy a CEN 


1+ yx 


With the notation 


we have 


PO = (/x + 


fx) = yva +a yV- va, 


Vx—1) + (Vx - vx- 1) +2 (vE+ Vx —1)(Vx- V¥x—1) 


= 2./x+ 2, 


so f(x) = y2(1 + 


/x). It follows that 


ee ER 


EN D =A] So 
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1 
By changing the variable-———— = t, we obtain 


Jit va 
I+J= BO as 2)y1 +x +C. 


Next, 


pa [45t yva 
=f 1+ J/x 


With the notation 


POT Beat ae. 


we have 


g(x) = 2Vx—2 > ga) = V2(Vx- 1). 


Therefore 


I-J= 


[ira a at 


Jar 9 — 4,/2 arctan Ast 


. By the Leibniz-Newton formula, 


27 x A a 
1=F(|—)-F(—)= — d 
(F) (5) [. sinx 


then with x = x — t, it follows that 


27/3 7 _ ¢ 27/3 4 x 
t= f - a= f — dx. 
x/3 sint z/3 sinx 


(Note the general useful result that hides behind, namely, the formula 


/ ' fQdx = i ; flat b—x)dx. 
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We mentioned this in Chapter 1, too.) Hence 


Ge ae xX\ 27/3 
2I= i —dx =a In (tan >) 
z/3 sinx 2 


z/3 


= m ln3. 


3. For x € [0, z], f(x) > 0 and for x € [, 27] , f(x) < 0. Thus f(x) sinx > 0, for 
all x € [0, 27]. By integrating by parts, we have 


27 27 
f F(x) cos xdx | F(x)(sin x)'dx 
0 0 


2x 2r 
= F(x)sin xl = f(x) sin xdx 
0 0 


2n 
=—-| f(x)sinxdx < 0. 
0 


4. Let F be an antiderivative of f; thus F’ = f on R. Because 


J af (x)dx = J xF' (x)dx = xF (x) — J F(x)dx 


the idea is to consider an antiderivative G of F (F is differentiable, thus 
continuous, thus it has antiderivatives), then note that x œ> xF(x) — G(x) is 
an antiderivative of x > xf (x). Finally glue together two such antiderivatives 
of x  —xf(x) (for x < 0) and of x > xf (x) (for x > 0) in order to obtain a 
continuous function. Namely, consider H : R > R defined by 


—xF (x) + G(x), for x <0 


HQ) = xF(x)— G(x) + 2G(0), for x> 0. 


We have H'(x) = |x|f(x) for all x Æ 0, and, by the corollary of Lagrange’s 
theorem, the differentiability of H at the origin follows; moreover, 


H'(0) = lim H' (x) = lim h(x) = h(0) 


follows; therefore H is an antiderivative of h on R. 
5. Let us define the function 


Deal 
x cosz, x #0 


A(x) = 
We 6 x=0 


For every nonzero real x, we have 


1 
h'(x) = 2xcos — + sin — 
x x 
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and 


. hœ- hO) _ 1 
lim —————— = lim | xcos — | = 0. 
x—>0 x—0 x—>0 X 


Hence the function A is differentiable with 


1 TE 
h'(x) = 2xcos + sin z, x40 
0, x=0 
Moreover, if we denote 
ee ~ x#0 
: x=0°’ 


then u is continuous, and 
h(x) = u(x) +f). 


Hence f(x) = W (x)—u(x) has antiderivatives, as the difference of two functions 
with antiderivatives. Observe that this is a special case of a problem discussed 
in the theoretical part. 


. Let us define the function A : [0,00) —> R by the formula 


xF(Inx), x>0 


h(x) = 0, sig 


For x > 0, we have 
h'(x) = F(nx) + fan x). 


Also 


h h(O 
lim on L = lim F(lnx) = lim F(y) =0, 
x—> y>—00 


x0 x- 


so A is differentiable with 


hæ = ee + f(inx), nae 
If u : [0, œ) > R, 
wey = [Anon x20 
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then u is continuous, and h'(x) = u(x) + g(x). From here, 


g(x) = h(x) — ua), 


so the function g has antiderivatives, as a difference of two functions that have 
antiderivatives. 

7. We will use the fact that the product of a C!-function and a function with 
antiderivatives is also a function with antiderivatives. Here, g(x) has antideriva- 
tives, and the functions sin and cos are C!, so the functions g(x) sinx and 
h(x) cos x have antiderivatives. We have 


g(x) sinx = f(x) sin? x, h(x) cosx = f(x) cos” x. 
Their sum also has antiderivatives, 
f(x) sin? x + f(x) cos? x = f(x). 


8. a) True. For a < b, we prove that f(/) is an interval, where J = (a,b). For 
c < a,c <0,f has the intermediate value property on (c, 00), so f(/) is an 
interval. 

b) True. If F, is the antiderivative of fı with F,(1) = 0, then the function 
F : R > R given by the formula 


F(x) = F,(x), x € (—n, oo) 


is well defined, and it is an antiderivative of the function f. 
c) False. One example is the function f : [0,0o) — R given by the formula 


sint, x € (0,00) l 


fo = [E x=0 


so 


[r (=+ 5) =e (: = =) ore: (16.1) 
X x 


Then 
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[F( E 5 =e? (1 + =) th, (16.2) 
x X 


From relations (16.1) and (16.2), we obtain 
ea 1 
e? [r (=+ | = (1 E >) eta 
x x 
1\ T 1 
FeDe 
x x 


and by adding, we deduce that 


att meele (x4 nj- efr (x- SJE 


Hence an antiderivative of the function g is 


G(x) = ; [r (=+ 5) + eF (x- 3) : 


and 


10. We have 


so 


By integration, 


1 
With the notation x — — = y, we have 
x 


ee. aa, 
== 
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and so 


reef) -of SP] 


2 


11. a) The function f has antiderivatives, and the tangent function (defined 
on (—2/2,2/2)) is differentiable, with continuous derivative, so g has 
antiderivatives. 

b) We have 


F(x) = g(x) cotx, 


for all x € (—/2,0) U (0, 2/2), so f has antiderivatives on the intervals 
(—z/2,0) and (0, 2/2), as the product between a function with antideriva- 
tives and a C!-function. Let F; , Fz be antiderivatives of the function f on the 
intervals (—2/2,0) and (0, 1/2) , respectively. The functions F; and F> are 
monotone, so there exist the limits 


lim Fi (x) = l, lim F(x) = L. 


These limits are finite, because according to Lagrange’s theorem, 


T 


lFi@—Fi (-5)| = en (x+ 2)] <5 


and similarly for F2. Now let us define the function F : (-1/2, 2/2) > R, by 
Fix) — l, x € (-2/2,0) 


F(x) = 4 0, x=0 
Fy(x) — b, xE (0, w/2) 


Then F is continuous on (—2/2, 2/2) and differentiable on (—2/2, z /2)\ {0}, 
with F’(x) = f(x). Thus, F is an antiderivative of the function f. 
12. The answer is yes. One example is 


f(x) = el Tte 


We have 


(Inf)’ = In F - =]nF > f'=flnF 
or 


f =(FilnF—FY. 
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Therefore the functions f and F In F — F differ by a constant. We try to find a 
function for which f = F In F — F or 


= sa 
FinF—F 


’ 


if we are lucky to find a function for which the denominator is not zero. Then 
[G(F(x))]/ = 1, where G is an antiderivative of 


1 


xlnx— x 


Let us put y = lnx, then 


/ cas =| a =| ay = In(nx—1)+C. 
xInx—x x(Inx — 1) y-1 


Thus 
Inin F(x) — 1) =x > InF(x) —1 =e > F(x) = e't”, 


so f (x) = eltt, 

Let us define the function g : R —> R by the formula g(x) = f(x) — x. The 
function g has antiderivatives, as the difference of two functions which have 
antiderivatives. We have 


88E) = sf) — x) = fF) — x) — Fa) —x) = x, 


so gog = 1g. From this it follows that g is injective. But any injective function 
with antiderivatives is continuous. 

Finally, the function f(x) = x + g(x) is continuous, as the sum of two 
continuous functions. 
We will prove that the function F : [0, co) > R given by the formula 


Hy [ soa x> 00 
0 


> x=0 


for a fixed xọ > 0 is an antiderivative of the function f, i.e., F’ = f. For x > 0, 


P= ( Í Soas) SA 
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15. 


16. 


Let now (¥n)n>0 be a sequence convergent to 0 and let k, be such that y, € 
(Xz, +1, Xk, ] for all n. Then the hypothesis implies 


/ “fd = / fodit Ea / sy Ca [roa 


Xkn—1 


sii (= 7 i) 
Xk +1 


This last quantity being convergent to 0, it follows that F is differentiable at 0 
and F’(0) = f(0) = 0. This finishes the proof. 


so 


eee 
Yn 


1 
The problem follows from the previous one, by taking x, = nET n eN. 
n 


1 1 
Being linear on each interval Fes: J with 
n+l n 


s), sE) cn 


1/n 
I fat = 0, 
1 


/(n+1) 


it follows that 


for all nonnegative integers n. Now we are under the hypothesis of the previous 
problem. 
The relation 


1 
1+x 


X 
ross! 


for all x > 0 can also be expressed by saying that the derivative of the function 
G(x) = F(x) — x + lIn(1 + x) is nonpositive on [0, 00). Thus G decreases on 
[0, co), consequently, G(x) < G(0) for all x > 0, that is, 


F(x) —x < F(0)— ln(1 + x) 
for x > 0. Letting x go to infinity, we get lim (F(x) — x) = —oo. 
roO 
Similarly, we have 


1 


l—x 


IOs- =1- 
1-x 
for all x < 0, which yields the nonpositivity of the derivative of 


A(x) = F(x) —x—In(1 — x), 


306 


17. 


16 Antiderivatives 
then 
F(x) —x => F(0) + ln(1 — x) 


for x < 0, and, further, „im (F (x) -x)= œ. 

Now, x +> F(x) — x is a continuous function whose limits at +00 have 
opposite signs. By the intermediate value theorem, there must be a point at 
which this function has zero value, that is, there exists a fixed point for F. 

However, two such points cannot exist. For, if we assumed that a Æ b, 
F(a) = a, and F(b) = b, then we would get the contradiction 


_ FO) =F) lel 


ie = Oa a 


& 1+ [c] < lel 


by using Lagrange’s mean value theorem (c is the intermediate point between a 

and b whose existence is assured by the theorem). (Actually, the condition for f 

shows that F is a contraction; hence it cannot have more than one fixed point.) 
This is a problem proposed by Sorin Rădulescu for the Romanian Mathe- 

matical Olympiad in the year 1986. (Yes, we were there.) 

Let A, b, c be defined by 


A(x) = | Lo (4) , for x40 
0, for x= 0 
b@) = | 2e(ag’(nif (fy). for x #0 
0, for x =0 
and 
c(x) = | HORELO) (5) for x #0 
0, for x= 0. 


One immediately sees that A’(x) = b(x) — c(x) — g'(0)A(x) for all x 4 0. 
Because 


1). fU/e@) o FO 
tim eta (25) = tim eG) eee 


and g(x)/x has a finite limit (g’(0)) at the origin, we also have 


A' (0) = lim Be) = lim g(x) -ef (=) = 0; 
x>0 x x>0 x g(x) 


0 
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hence the equality A’ (x) = b(x) — c(x) — g’(0)A(x) actually holds for all x € R. 
Because (as we have just seen) lim g(x)f(1/g(x)) = 0 and because 


lim g'(x) = g' (0) is finite, the continuity of b at the origin follows. Similarly, 
since lim (g’ (x) — g’(0)) = 0 and f’ is bounded, c is continuous at the origin. 


Also, the fact that b and c are continuous at nonzero points is a consequence 
of the conditions from the hypothesis—thus b and c are continuous functions 
on R. Being continuous, they surely have antiderivatives, while A’ obviously 
has antiderivative A. All the above facts yield the conclusion that 


1 

h = Zo” c—A) 
has antiderivatives, too, which finishes the proof. 

This is a problem proposed by Ion Chitescu for the Romanian Mathematical 
Olympiad in the year 1990. (We were not there anymore.) 
Let a < b be reals such that f(a)/g(a) and f(b)/g(b) are distinct and let y 
be between f(a)/g(a) and f(b)/g(b). Note that g has the intermediate value 
property, because it has antiderivatives. Since g is nonzero, it must have a 
constant sign on R. It follows that g(a) and g(b) have the same sign; hence 
f (a)—yg(a) and f(b)—yg(b) have opposite signs. Thus for the function h defined 
by A(t) = f(t) — yg (t), for t € [a, b], we have h(a)h(b) < 0. On the other hand, 
because f and g both have antiderivatives, h has also antiderivatives; therefore h 
has the intermediate value property. Consequently, there exists x € (a,b) such 
that h(x) = 0, meaning that (f/g)(x) = y. 
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